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MANIPULATION OF PROTOPORPHYRINOGEN OXIDASE ENZYME ACTIVITY IN 

EUKARYOTJC ORGANISMS 



The invention relates generally to the manipulation of the enzymatic activity 
responsible for the conversion of protoporphyrinogen IX to protoporphyrin IX in a 
biosynthetic pathway common to all eukaryotic organisms, in one aspect, the invention 
is applied to the development of herbicide resistance in plants, plant tissues and seeds. 
In another aspect, the invention is applied to the development of diagnostics and 
treatments for deficiencies in this enzymatic activity in animate, particularly humans. 

The biosynthetic pathways which leads to the production of chlorophyll and heme 
share a number of common steps. Chlorophyll is a light harvesting pigment present in 
all green photosynthetic organisms. Heme is a cofactor of hemoglobin, cytochromes, 
P450 mixed-function oxygenases, peroxidases, and catalases (see, e.g. Lehninger, 
Biochemistry . Worth Publishers, New York (1975)), and is therefore a necessary 
component for all aerobic organisms. 

The last common step in chlorophyll and heme biosynthesis is the oxidation of 
protoporphyrinogen IX to protoporphyrin IX. Protoporphyrinogen oxidase (referred to 
herein as "protox") is the enzyme which catalyzes this last oxidation step (Matringe et 
aL Biochem. J. 260:231 (1989)). 

The protox enzyme has been purified either partially or completely from a number of 
organisms including the yeast Saccharomyces cerevisiae (Labbe-Bois and Labbe, In 

moavrunya i b u i nu i nu dim unmuuiiu i i . e.i i. um i m. eu. mlO i umi i mi hum vmh , r 

235-285 (1990)), barley etioplasts (Jacobs and Jacobs, Biochem. J. 244:219 (1987)), 
and mouse liver (Dailey and Karr, Biochem. 26: 2697 (1987)). Genes encoding protox 
have been isolated from two prokaryotic organisms, Escherichia coli (Sasarman et al. r 
Can. J. Microbiol. 39: 1 155 (1 993)) and Bacillus subtilis (Dailey et aL, J. Biol. Chem. 
269:813 (1994)). These genes share no sequence similarity; neither do their predicted 
protein products share any amino acid sequence identity. The E. coli protein is 
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approximately 21 kDa, and associates with the cell membrane. The B. subti/is protein 
is 51 kDa, and is a soluble, cytoplasmic activity. 

Presently, too little is known about the protox enzyme to allow isolation of protox 
encoding genes from higher eukaryotic organisms (i.e. animals, plants and all other 
multicellular nucleate organisms other than lower eukaryotic microorganisms such as 
yeast, unicellular algae, protozoans, etc.) using known approaches. 

In particular, many of the standard techniques for isolation of new proteins and 
genes are based upon the assumption that they will be significantly similar in primary 
structure (i.e. amino acid and DNA sequence) to known proteins and genes that have 
the same function. Such standard techniques include nucleic acid hybridization and 
amplification by polymerase chain reaction using oligonucleotide primers corresponding 
to conserved amino acid sequence motifs. These techniques would not be expected to 
be useful for isolation of eukaryotic protox genes using present structural information 
which is limited to prokaryotic protox genes since there is no significant structural 
similarity even among the known prokaryotic protox genes and proteins. 

Another approach that has been used to isolate biosynthetic genes in other 
metabolic pathways from higher eukaryotes is the complementation of microbial 
mutants deficient in the activity of interest. For this approach, a library of cDNAs from 
the higher eukaryote is cloned in a vector that can direct expression of the cDNA in the 
microbial host. The vector is then transformed or otherwise introduced into the mutant 
microbe, and colonies are selected that are phenotypically no longer mutant. 

This strategy has worked for isolating genes from higher eukaryotes that are 
involved in several metabolic pathways, including histidine biosynthesis (e.g. U.S. 
patent no 5290926 and WO 94/026909 to Ward etal., incorporated by reference herein 

purine biosynthesis (e.g. Aimi etal., J. Biol. Chem. 265: 901 1 (1990)), and tryptophan 
biosynthesis (e.g. Niyogi etal., Plant Cell 5: 101 1 (1993)). However, despite the 
availability of microbial mutants thought to be defective in protox activity (e.g. E. coll 
(Sasarman etal., J. Gen. Microbiol. 113:297 (1979)), Salmonella typhimurium (Xu et 
a/., J. Bacterid. 174:3953 (1992)), and Saccharomyces cerevisiae (Camadro et at,. 
Biochem. Biophys. Res. Comm. 106: 724 (1982)), application of this technique to 
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isolate cDNAs encoding eukaryotic protox enzymatic activity is at best unpredictable 
based on the available information. 

There are several reasons for this. First, the eukaryotic protox cDNA sequence may 
not be expressed at adequate levels in the mutant microbe, for instance because of 
codon usage inconsistent with the usage preferences of the microbial host. Second, 
the primary translation product from the cloned eukaryotic coding sequence may not 
produce a functional polypeptide, for instance if activity requires a post-translational 
modification, such as glycosylation, that is not carried out by the microbe. Third, the 
eukaryotic protein may fail to assume its active conformation in the microbial host, for 
instance if the protein is normally targeted to a specific organellar membrane system 
that the microbial host specifically lacks. This last possibility is especially likely for the 
plant protox enzyme, which is associated in the plant cell with organelles not present in 
microbial hosts used in the complementation assay. In particular, the plant protox 
enzyme is associated with both the chloroplast envelope and thylakoid membranes 
(Matringe et at., J. Biol. Chem. 267:4646 (1992)), and presumably reaches those 
membrane systems as a result of a post-translational targeting mechanism involving 
both an N-terminal transit sequence, and intrinsic properties of the mature polypeptide 
(see, e.g. Kohorn andTobin, PlantCell /; 159 (1989); Li etaL Plant Cell3:709 (1991); 
U et ai, J. Biol. Chem. 267: 1 8999 (1 992)). 

The protox enzyme is known to play a role in certain human disease conditions. 
Patients suffering from variegate porphyria, an autosomal dominant disorder 
characterized by both neuropsychiatry symptoms and skin lesions, have decreased 
levels of protox activity (Brenner and Bloomer, New Engl. J. Med. 302: 765 (1980)). 

n t ,a tr, tho IpM, rsi l,n^»,l Q Hp Q rnq^rrlinp fhp human pr^v on^/mn onH itc 

corresponding gene, options for diagnosing and treating this disorder are presently very 
limited. 

The use of herbicides to control undesirable vegetation such as weeds or plants in 
crops has become almost a universal practice. The relevant market exceeds a billion 
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dollars annually. Despite this extensive use, weed control remains a significant and 
costly problem for farmers. 

Effective use of herbicides requires sound management. For instance, time and 
method of application and stage of weed plant development are critical to getting good 
weed control with herbicides. Since various weed species are resistant to herbicides, 
the production of effective herbicides becomes increasingly important. 

Unfortunately, herbicides that exhibit greater potency, broader weed spectrum and 
more rapid degradation in soil can also have greater crop phytotoxicity. One solution 
applied to this problem has been to develop crops which are resistant or tolerant to 
herbicides. Crop hybrids or varieties resistant to the herbicides allow for the use of the 
herbicides without attendant risk of damage to the crop. Development of resistance can 
allow application of a herbicide to a crop where its use was previously precluded or 
limited (e.g. to pre-emergence use) due to sensitivity of the crop to the herbicide. For 
example, U.S. Patent No. 4,761,373 to Anderson et al. is directed to plants resistant to 
various imidazolinone or sulfonamide herbicides. The resistance is conferred by an 
altered acetohydroxyacid synthase (AH AS) enzyme. U.S. Patent No. 4,975,374 to 
Goodman etal. relates to plant cells and plants containing a gene encoding a mutant 
glutamine synthetase (GS) resistant to inhibition by herbicides that were known to 
inhibit GS, e.g. phosphinothricin and methionine sulfoximine. U.S. Patent No. 
5,013,659 to Bedbrook et al is directed to plants that express a mutant acetolactate 
synthase which renders the plants resistant to inhibition by sulfonylurea herbicides. 
U.S. Patent No. 5,162,602 to Somers etal discloses plants tolerant to inhibition by 
cyclohexanedione and aryloxyphenoxypropanoic acid herbicides. The tolerance is 
conferred by an altered acetyl coenzyme A carboxylase(ACCase). 

the protox enzyme serves as the target tor a variety of hferbicidil 66frt(MUnfls. I R5 
herbicides that inhibit protox include many different structural classes of molecules 
(Duke etal, WeedSci. 39: 465 (1991); Nandihalli etaL Pesticide Biochem. Physiol. 
43: 193 (1992); Matringe etal, FEBS Lett. 245:35 (1989); Yanase and Andoh, 
Pesticide Biochem. Physiol 35: 70 (1989)). These herbicidal compounds include the 
diphenylethers {e.g. acifluorfen, 5-[2-chloro-4-(trifluoromethyl)phenoxy]-2-nitrobezoic 
acid; its methyl ester; or oxyfluorfen, 2-chloro-1-(3-ethoxy-4-nitrophenoxy)-4- 
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(trifluorobenzene)}, oxidiazoles, (e.g. oxidiazon, 3-[2,4-dichloro-5-(1- 
methylethoxy)pheny[]-5-(1 ,1<iimethylethyl)-1,3,4-oxadiazol-2-(3H)-one), cyclic imides 
(e.g. S-23142, A^(4-chloro-2-fluoro-5-propargyloxyphenyl)-3,4,5,6- 
tetrahydrophthalimide; chlorophthalim, W-(4-chlorophenyl)-3,4,5,6- 
tetrahydrophthalimide) t phenyl pyrazoles (e.g. TNPP-ethyl, ethyl 2-[1 -(2,3,4- 
trichlorophenyl)-4-nitropyrazolyl-5-oxy]propionate; M&B 39279), pyridine derivatives 
(e.g. LS 82-556), and phenopylate and its Ophenylpyrrolidino- and 
piperidinocarbamate analogs. Many of these compounds competitively inhibit the 
normal reaction catalyzed by the enzyme, apparently acting as substrate analogs. 

The predicted mode of action of proiox-inhibiiing herbicides involves ihe 
accumulation of protoporphyrinogen IX in the chloroplast. This accumulation is thought 
to lead to leakage of protoporphyrinogen IX into the cytosol where it is oxidized by a 
peroxidase activity to protoporphyrin IX. When exposed to light, protoporphyrin IX can 
cause formation of singlet oxygen in the cytosol. This singlet oxygen can in turn lead to 
the formation of other reactive oxygen species, which can cause lipid peroxidation and 
membrane disruption leading to rapid cell death (Lee et a/., Plant Physiol. 102: 881 
(1993)). 

Not all protox enzymes are sensitive to herbicides which inhibit plant protox 
enzymes. Both of the protox enzymes encoded by genes isolated from Escherichia coli 
(Sasarman et a/., Can. J. Microbiol 39: 1 155 (1993)) and Bacillus subtilis (Dailey etai t 
J. Biol. Chem. 269: 813 (1994)) are resistant to these herbicidal inhibitors. In addition, 
mutants of the unicellular alga Chlamydornonas reinhardtii resistant to the phenylimide 
herbicide S-23142 have been reported (Kataoka et al. t J. Pesticide Scl 15:449 (1990); 
Shibata et al.. In Research in Photosynthesis . Vol. Ill, N. Murata, ed. 
Kluwer:Netherlands. pp. 567-570 (1992)). At least one of these mutants appears to 
have an altered protox activity that is resistant not only to the herbicidal inhibitor on 
which the mutant was selected, but also to other classes of protox inhibitors (Oshio et 
al., Z Naturforsch. 48c: 339 (1993); Sato etal., In ACS Symposium on Porphyric 
Pesticides , S. Duke, ed. ACS Press: Washington, D.C. (1994)). A mutant tobacco cell 
line has also been reported that is resistant to the inhibitor S-21432 (Che etal.,. Z 
Naturforsch. 48c: 350 (1993). 
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The present invention provides an isolated DNA molecule encoding the 
protoporphyrinogen oxidase (protox) enzyme from a eukaryotic organism, which 
preferably is a higher eukaryotic organism. In particular, the present invention provides 
isolated DNA molecules encoding the protoporphyrinogen oxidase (protox) enzyme 
from a plant or human source. 

Preferred within the scope of the invention are isolated DNA molecules encoding the 
protoporphyrinogen oxidase (protox) enzyme from dicotyledonous plants, but especially 
from Arabidopsis plants, such as those given in SEQ ID NOS: 1 , 3, and 9. 

Also preferred are isolated DNA molecules encoding the protoporphyrinogen 
oxidase (protox) enzyme from monocotyledonous plants, but especially from maize 
plants, such as those given in SEQ ID NOS: 5 and 7. Especially preferred within the 
invention is an isolated DNA molecule encoding the protoporphyrinogen oxidase 
(protox) enzyme protein from a dicotyledonous plant, wherein said protein comprises 
the amino acid sequence selected from the group consisting of SEQ ID Nos. 2, 4 and 
1 0. Also preferred is a an isolated DNA molecule encoding the protoporphyrinogen 
oxidase (protox) enzyme protein from a monocotyledonous plant, wherein said protein 
comprises the amino acid sequence selected from the group consisting of SEQ ID Nos. 
6 and 8. 

Using the information provided by the present invention, the DNA coding sequence 
for the protoporphyrinogen oxidase (protox) enzyme from any eukaryotic organism may 
be obtained using standard methods. Thus, in a further embodiment the present 
invention provides probes capable of specifically hybridizing to a eukaryotic DNA 
o fl qtiow as a n a odm g a p r a U pi P p hjrriw i Hn i i i Hm i ■B l hnfrjf o r t a Ifci g rst p M l h i n mnt l A 
and methods for detecting the said DNA sequences in eucaryotic organisms using the 
probes according to the invention. 

The present invention further enbodies expression cassetts and recombinant vectors 
comprising the said expression cassetts comprising essentially a promoter, but 
especially a promoter that is active in a plant, operably linked to a DNA molecule 

nnoodino the nrotonorphyrinoqen oxidase (protox^ enzyme from a eukaryotic orcjanism 
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according to the invention. The expression cassette according to the invention may in 
addition further comprise a signal sequence operably linked to said DNA molecule, 
wherein said signal sequence is capable of targeting the protein encoded by said DNA 
molecule into the chloroplast or the mitochondria. 

In addition, the present invention provides plants, plant cells, plant tissues and plant 
seeds with altered protox activity which are resistant or at least tolerant to inhibition by 
a herbicide at levels which normally are inhibitory to the naturally occurring protox 
activity in the plant. In particular, the invention embodies plants, wherein the altered 
protox activity is conferred by over expression of the wild-type protox enzyme or by 
expression of a DNA molecule encoding a herbicide tolerant protox enzyme. The said 
herbicide tolerant protox enzyme may be a modified form of a protox enzyme that 
naturally occurs in a eukaryote or a prokaryote; or a modified form of a protox enzyme 
that naturally occurs in said plant; or the said herbicide tolerant protox enzyme may 
naturally occur in a prokaryote. Plants encompassed by the invention include 
monocotyledonous and dicotyledonous plants, but especially hybrid plants* Preferred 
are those plants which would be potential targets for protox inhibiting herbicides, 
particularly agronomically important crops such as maize and other cereal crops such 
as wheat, oats, rye, sorghum, rice, barley, millet, turf and forage grasses, and the like, 
as well as cotton, tobacco, sugar cane, sugar beet, oilseed rape, and soybeans. 

The present invention further encompasses propagating material of a plant according 
to the invention, preferably plant seed, treated with a protectant coating, but especially a 
protectant coating comprising a preparation selected from the group consisting of 
herbicides, insecticides, fungicides, bactericides, nematicides, molluscicides or mixtures 

The present invention is further directed to methods for the production of plants, 
plant cells, plant tissues, and plant seeds and the transgenic progeny thereof which 
contain a protox enzyme resistant to, or tolerant of inhibition by a herbicide at a 
concentration which inhibits the naturally occurring protox activity. The said resistance 
or tolerance may be obtained by expressing in the said transgenic plants either a DNA 
molecule encoding a modified form of a protox enzyme that naturally occurs in a 

- ii-nrynto ~r ^ n^nrjif'^ form ^* r» pmtny on7vm^ thn f nat'jrnHv occurs in s^id plnnt P r 
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a protox enzyme that naturally occurs in a prokaryot, or a protox enzyme which is a 
modified form of a protein which naturally occurs in a prokaryote. 

One specific embodiment of the invention is directed to the preparation of transgenic 
maize plants, maize tissue or maize seed and the transgenic progeny thereof which 
have been stably transformed with a recombinant DNA molecule comprising a suitable 
promoter functional in plants operably linked to a structural gene encoding an 
unmodified prokaryotic protox enzyme which is resistant to the herbicide. 

The invention is further directed to the preparation of transgenic plants, plant cells, 
plant tissue and plant seed and the transgenic progeny thereof which has been stably 
transformed with a recombinant DNA molecule comprising a suitable promoter 
functional in plants operably linked to a structural gene encoding an unmodified 
eukaryotic protox enzyme. This results in over-expression of the unmodified protox in 
the plant sufficient to overcome inhibition of the enzyme by the herbicide. 

The present invention also embodies the production of plants which express an 
altered protox enzyme tolerant of inhibition by a herbicide at a concentration which 
normally inhibits the activity of wild-type, unaltered protox. In this embodiment, the 
plant may be stably transformed with a recombinant DNA molecule comprising a 
structural gene encoding the resistant protox, or prepared by direct selection 
techniques whereby herbicide resistant lines are isolated, characterized and developed. 

The present invention is further directed to a method for controlling the growth of 
"P f |p < >i r p f 1 ypnptfition whirh rornnrisp*! annliring tn i nnpnlitinn nf 9 phnt with nltnrnri 
protox activity which is resistant to inhibition by a herbicide at levels which normally are 
inhibitory to the naturally occurring protox activity in the said plant, an effective amount 
of a protox-inhibiting herbicide. Plants to be protected in the described way are 
especially those which would be potential targets for protox inhibiting herbicides, 
particularly agronomicaliy important crops such as, for example, maize and other cereal 
crops such as wheat, oats, rye, sorghum, rice, barley, millet, turf and forage grasses, 
and the like, as well as cotton, suqar cane, sugar beet oilseed rape, and soybeans 
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Herbicides thai qualify as protox inhibitors are those selected from the group consisting 
of aryiuracil, diphenylether, oxidiazole, imide, phenyl pyrazole, pyridine derivative, 
phenopylate and O-phenylpyrrolidino- and piperidinocarbamate analogs of said 
phenopylate. 

The present invention also embodies the recombinant production of the protox 
enzyme, and methods for using recombinantly produced protox. The invention thus 
further embodies host cells, but especially ceils sleeted from the group consisting of 
plant cells, animal cells, bacterial cells, yeast cells and insect cells, stably transformed 
with a recombinant DNA moiecuie comprising a suitable promoter functional in the 
respective host cell operably linked to a structural gene encoding an unmodified or 
modified eukaryotic protox enzyme, wherein said host cell is capable of expressing said 
DNA molecule. 

The present invention further provides methods of using purified protox to screen for 
novel herbicides which affect the activity of protox, and to identify herbicide-resistant 
protox mutants. 

In particular, the invention is directed to a method for assaying a chemical for the 
ability to inhibit the activity of a protox enzyme from a plant comprising 

(a) combining said protox enzyme and protoporphyrinogen IX in a first reaction 
mixture under conditions in which said protox enzyme is capable of catalyzing the 
conversion of said protoporphyrinogen IX to protoporphyrin IX; 

(b) combining said chemical, said protox enzyme and protoporphyrinogen IX in a 
second re&6ti&ft mixture UMUur Illy uai l lb tu ll UUiul l s a*. Hi 3>fllU III ji imulun mUuu i u, 

(c) exciting said first and said second reaction mixtures at about 395 to about 410 
nM; 

(d) comparing the flourescence of said first and said second reaction mixtures at 
about 622 to about 635 nM; 

wherein said chemical is capable of inhibiting the activity of said protox enzyme if the 
flourescence of said second reaction mixture is significantly less than the flourescence 
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In a further embodiment of the invention a method is provided for identifying a 
modified protox enzyme resistant to a protox inhibitor present in a population of cells 
comprising the steps of 

(a) culturing said population in the presence of said protox inhibitor in amounts which 
inhibit the unmodified form of said protox enzyme; 

(b) selecting those cells from step (a) whose growth is not inhibited; and 

(c) isolating and identifying the protox enzyme present in the cells selected from step 
(b). 

Genes encoding altered protox can be used as selectable markers in plant cell 
transformation methods. The present invention thus further embodies a method of 
selecting plants, plant tissue or plant cells transformed with a transgene of interest from 
non-transformed plants, comprising the steps of: 

(a) transforming a plant, plant tissue or plant cell with a transgene of interest capable 
of being expressed by the plant, and a gene encoding an altered protox resistant to a 
protox inhibitor; 

(b) transferring the thus-transformed plants or plant cells to a medium comprising the 
protox inhibitor; and 

(c) selecting the plants or plant cells which survive in the medium. 

The present invention is further directed to probes and methods for detecting the 
presence and form of the protox gene and quantitating levels of protox transcripts in 
an organism. These methods may be used to diagnose disease conditions which are 
a^uiilatbU M i l an diUJ i bU iuhii ui me piuiux emyme u i aiieieu ieucu uf eApi e aa l un uf 
the protox enzyme. 

In one aspect, the present invention is directed to an isolated DNA molecule which 
encodes a eukaryotic form of protoporphyrinogen oxidase (referred to herein as 
"protox"), the enzyme which catalyzes the oxidation of protoporphyrinogen IX to 
protoporphyrin IX. The DNA coding sequences and corresponding amino acid 
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1-4 and 9-10. The DNA coding sequences and corresponding amino acid sequences 
for maize protox enzymes are provided as SEQ ID Nos 5-8. 

Any desired eukaryotic DNA encoding the protox enzyme may be isolated according 
to the invention. One method taught for isolating a eukaryotic protox coding sequence 
is represented by Example 1. In this method cDNA clones encoding a protox enzyme 
are identified from a library of cDNA clones derived from the eukaryote of interest 
based on their ability to supply protox enzymatic activity to a mutant host organism 
deficient in this activity. Suitable host organisms for use in this method are those which 
can be used to screen cDNA expression libraries and for which mutants deficient in 
protox activity are either available or can be routinely generated. Such host organisms 
include, but are not limited to, E. coli (Sasarman etal., J. Gen. Microbiol. 113:297 
(1979)), Salmonella typhimurium (Xu et a/., J. Bacterid. 174: 3953 (1992)), and 
Saccharomyces cerevisiae (Camadro etat. Biochem. Biophys. Res. Comm. 106:724 
(1982)). 

Alternatively, eukaryotic protox coding sequences may be isolated according to well 
known techniques based on their sequence homology to the Arabidopsis thaliana (SEQ 
ID Nos. 1 ,3 and 9) and Zea mays (SEQ ID Nos. 5 and 7) protox coding sequences 
taught by the present invention. In these techniques all or part of the known protox 
coding sequence is used as a probe which selectively hybridizes to other protox coding 
sequences present in population of cloned genomic DNA fragments or cDNA fragments 
(i.e. genomic or cDNA libraries) from a chosen organism. Such techniques include 
hybridization screening of plated DNA libraries (either plaques or colonies; see, e.g.. 
Sambrook etai, Molecular Cloning , eds., Cold Spring Harbor Laboratory Press. 
(1989)) and amplification by PCR using oligonucleotide primers corresponding to 
sequence domains conserved among known protox amino acid sequences (see, e.g. 
Innis et aL, . PCR Protocols, a Guide to Methods and Applications eds., Academic 
Press (1990)). These methods are particularly well suited to the isolation of protox 
coding sequences from organisms related to the organism from which the probe 
sequence is derived. For example, application of these methods using the Arabidopsis 
or Zea mays coding sequence as a probe would be expected to be particularly well 
suited for the isolation of protox coding sequences from other plant species. 
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The isolated eukaryotic protox sequences taught by the present invention may be 
manipulated according to standard genetic engineering techniques to suit any desired 
purpose. For example, the entire protox sequence or portions thereof may be used as 
probes capable of specifically hybridizing to protox coding sequences and messenger 
RNAs. To achieve specific hybridization under a variety of conditions, such probes 
include sequences that are unique among protox coding sequences and are preferably 
at least 10 nucleotides in length, and most preferably at least 20 nucleotides in length. 
Such probes may be used to amplify and analyze protox coding sequences from a 
chosen organism via the well known process of polymerase chain reaction (PCR). 
This technique may be useful to isolate additional protox coding sequences from a 
desired organism or as a diagnostic assay to determine the presence of protox coding 
sequences in an organism and to associate altered coding sequences with particular 
adverse conditions such as autosomal dominant disorder in humans characterized by 
both neuropsychiatry symptoms and skin lesions, have decreased levels of protox 
activity (Brenner and Bloomer, New Engl. J. Med. 302: 765 (1980)). 

Protox specific hybridization probes may also be used to map the location of the 
native eukaryotic protox gene(s) in the genome of a chosen organism using standard 
techniques based on the selective hybridization of the probe to genomic protox 
sequences. These techniques include, but are not limited to, identification of DNA 
polymorphisms identified or contained within the protox probe sequence, and use of 
such polymorphisms to follow segregation of the protox gene relative to other markers 
of known map position in a mapping population derived from self fertilization of a hybrid 
of two polymorphic parental lines (see e.g. Helentjaris et ai t Plant Mol. Biol. 5: 109 
(1985). Sommer et ai Biotechniques 1282 (1992); D'Ovidio efa/., Plant MoL Biol. 15: 

Ibd ( iyyujj. Whim ally yukUiym i u iJiuiua equality n lu i umup i dieu iu uc useful as a 

probe for mapping protox genes from any eukaryotic organism, preferred probes are 
those protox sequences from organisms more closely related to the chosen organism, 
and most preferred probes are those protox sequences from the chosen organism. 
Mapping of protox genes in this manner is contemplated to be particularly useful in 
plants for breeding purposes. For instance, by knowing the genetic map position of a 
mutant protox gene that confers herbicide resistance, flanking DNA markers can be 
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identified from a reference genetic map (see, e.g., Helentjaris, Trends Genet 3:217 
(1987)). During introgression of the herbicide resistance trait into a new breeding line, 
these markers can then be used to monitor the extent of protox-linked flanking 
chromosomal DNA still present in the recurrent parent after each round of back- 
crossing. 

Protox specific hybridization probes may also be used to quantitate levels of protox 
mRNA in an organism using standard techniques such as Northern blot analysis. This 
technique may be useful as a diagnostic assay to detect altered levels of protox 
expression that may be associated with particular adverse conditions such as 
autosomal dominant disorder in humans characterized by both neurcpsychiatric 
symptoms and skin lesions, have decreased levels of protox activity (Brenner and 
Bloomer, New Engl. J. Med. 302: 765 (1980)). 

For recombinant production of the enzyme in a host organism, the protox coding 
sequence may be inserted into an expression cassette designed for the chosen host 
and introduced into the host where it is recombinantly produced. The choice of specific 
regulatory sequences such as promoter, signal sequence, 5* and 3' untranslated 
sequences, and enhancer, is within the level of skill of the routineer in the art. The 
resultant molecule, containing the individual elements linked in proper reading frame, 
may be inserted into a vector capable of being transformed into the host cell. Suitable 
expression vectors and methods for recombinant production of proteins are well known 
for host organisms such as E. coli (see, e.g. Studier and Moffatt, J. Mol. Biol. 189: 113 
(1986); Brosius, DNA 8: 759 (1989)), yeast (see, e.g., Schneider and Guarente, Meth. 
Enzymol. 194: 373 (1991)) and insect cells (see, e.g., Luckow and Summers, 
Bio/Technol. 6:47 (1988)). Specific examples include plasmids such as pBluescript 

(btte!fl6M6. LA JAiia, LA), phLAb ( i mernaHWmi BiBiBennoiog i BB, inc., my) Hawn, 

CT), pTrcHis (Invitrogen, La Jolla, CA), and baculovirus expression vectors, e.g., those 
derived from the genome of Autographies caiifornica nuclear polyhedrosis virus 
(AcMNPV). A preferred baculovirus/insect system is pVI1 1392/Sf21 cells (Invitrogen, 
La Jolla, CA). 

Recombinantly produced eukaryotic protox enzyme is useful for a variety of 
purposes. For example, it may be used to supply protox enzymatic activity in vitro. It 



WO 95/34659 



PCI7IB95/00452 



- 14 - 

may also be used in an in vitro assay to screen known herbicidal chemicals whose 
target has not been identified to determine if they inhibit protox. Such an in vitro assay 
may also be used as a more general screen to identify chemicals which inhibit protox 
activity and which are therefore herbicide candidates. Alternatively, recombinantly 
produced protox enzyme may be used to further characterize its association with known 
inhibitors in order to rationally design new inhibitory herbicides as well as herbicide 
tolerant forms of the enzyme. 

Typically, the inhibitory effect on protox is determined by measuring fluorescence at 
about 622 to 635 nm f after excitation at about 395 to 410 nM (see, e.g. Jacobs and 
Jacobs, Enyzme 28: 206 (1382); Sherman eta!., Plant Physiol. 97:280 (1991)). This 
assay is based on the fact that protoporphyrin IX is a fluorescent pigment, and 
protoporphyrinogen IX is nonfluorescent. Protein extracts are prepared from selected 
subcellular fractions, e.g. etioplasts, mitochondria, microsomes, or plasma membrane, 
by differential centrifugation (see, e.g. Lee etal.. Plant Physiol. 702:881 (1993); Prado 
et a!, Plant Physiol. 65:956 (1979); Jackson and Moore, in Plant Organelles . Reid, ed., 
pp. 1-12; Jacobs and Jacobs, Plant Physiol. 101: 1 181 (1993)). Protoporphyrinogen is 
prepared by reduction of protoporphyrin with a sodium amalgam as described by 
Jacobs and Jacobs (1982). Reactions mixtures typically consist of 100 mM Hepes (pH 
7.5), 5 mM EDTA, 2 mM DTT, about 2 M protoporphyrinogen IX, and about 1 mg/mL 
protein extract. Inhibitor solutions in various concentrations, e.g. 1 mM, 100 uM, 10 uM, 
1 uM, 100 nM, 10 nM. 1nM, 100pM, are added to the enzyme extract prior to the 
initiation of the enzyme reaction. Once the protein extract is added, fluorescence is 
monitored for several minutes, and the slope of the slope (reaction rate) is calculated 
from a region of linearity. IC50 is determined by comparing the slope of the inhibited 
reaction to a control reaction . 

Another embodiment of the present invention involves the use of protox in an assay 
to identify inhibitor-resistant protox mutants. A typical assay is as follows: 

(a) incubating a first sample of protox and its substrate, protoporphyrinogen IX, in 
the presence of a second sample comprising a protox inhibitor; 

(b) measuring the enzymatic activity of the protox from step (a); 
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(c) incubating a first sample of a mutated protox and its substrate in the presence of 
a second sample comprising the same protox inhibitor; 

(d) measuring the enzymatic activity of the mutated protox from step (c); and 

(e) comparing the enzymatic activity of the mutated protox with that provided by the 
unmutated protox. 

The reaction mixture and the reaction conditions are the same as for the assay to 
identify inhibitors of protox (inhibitor assay) with the following modifications. First, a 
protox mutant, obtained as described above, is substituted in one of the reaction 
mixtures for the wild-type protox of the inhibitor assay. Second, an inhibitor of wild-type 
protox is present in both reaction mixtures. Third, mutated activity (snzyme activity in 
the presence of inhibitor and mutated protox) and unmutated activity (enzyme activity in 
the presence of inhibitor and wild-type protox) are compared to determine whether a 
significant increase in enzyme activity is observed in the mutated activity when 
compared to the unmutated activity. Mutated activity is any measure of enzymatic 
activity of the mutated protox enzyme while in the presence of a suitable substrate and 
the inhibitor. Unmutated activity is any measure of enzymatic activity of the wild-type 
protox enzyme while in the presence of a suitable substrate and the inhibitor. A 
significant increase is defined as an increase in enzymatic activity that is larger than the 
margin of error inherent in the measurement technique, preferably an increase by about 
2-fold of the activity of the wild-type enzyme in the presence of the inhibitor, more 
preferably an increase by about 5-fold, most preferably an increase greater than by 
about 1 0-fold. 

The herbicides that inhibit protox include many different structural classes of 
molecules (Duke etai, WeedSci. 39: 465 (1991); Nandihalli etal., Pesticide Biochem. 
Hhyiidl. 4'J! lyj (lyy^J! MtH l lnyy yi ul. PEB3 Ltili 24J. jj [\bu$1 I ' an&BB miU Anuuli, 
Pesticide Biochem. Physiol. 35: 70 (1989)), including the diphenylethers {e.g. 
acifluorifen, 5-[2-chloro-4-(trifluoromethyl)phenoxy]-2-nitrobezoic acid; its methyl ester; 
or oxyfluorfen, 2-chloro-1-(3-ethoxy-4-nitrophenoxy)-4-(trifiuorobenzene)} ( oxidiazoles 
(e.g. oxidiazon, 3-[2 t 4-dichloro-5-(1 -methylethoxy)phenyl]-5-(1 ,1 -dimethylethyl)-1 ,3,4- 
oxadiazoN2-(3H)-one), cyclic imides (e.g. S-23142, AA(4-chloro-2-fluoro-5- 
propargyloxyphenyl)-3,4,5,6-tetrahydrophthalimide; chlorophthalim, A/-(4-chlorophenyl)- 
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3,4,5,6-tetrahydrophthalimide), phenyl pyrazoles (e.g. TNPP-ethyl, ethyl 2-[1 -(2,3,4- 
trichlorophenyl)-4-nitropyrazolyl-5-oxy]propionate; M&B 39279), pyridine derivatives 
(e.g. LS 82-556), and phenopylate and its O-phenylpyrrolidino- and 
piperidinocarbamate analogs. 

The diphenylethers of particular significance are those having the general formula 




(Formula I) 



wherein R equals -COONa (Formula II), -CONHSO2CH3 (Formula III) or - 
COOCH2COOC2H5 (Formula IV; see Maigrot et aL, Brighton Crop Protection 
Conference-Weeds: 47-51 (1989)). 



Additional diphenylethers of interest are those where R equals: 

NOCH2COOCH3 
CCH 2 OCH 3 

(Formula IVa; see Hayashi etal., Brighton Crop Protection 
Conference-Weeds: 53-58 ( 1 989)) . 

An additional diphenylether of interest is one having the formula: 

COOCH3 



CI 



(Formula IVb; bifenox, see Dest etal., Proc. Northeast Weed Sci. 
Conf. 27/31 (1973)). 



Also of significance are the class of herbicides known as imides, having the general 
formula 
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(Formula V) 



wherein Q equals 



o OR n — OR 

(Formula VI) (Formula VII) 



CF, 



o OR 



(Formula VIII) 



Chfc 



I N— 



HF Z C 



(Formula IX) 



OR 




(Formula IXa) 



OR 

(Formula Ixb) 



CH 3 



(see Hemper etai (1995) in "Proceedings of the Eighth International Congress of 
Pesticide Chemistry", Ragdale etai, eds. f Amer. Chem. Soc, Washington, D.C., 
pp.42-48(1994)). 



and Ri equals H, CI or F. R 2 equals CI and R 3 is an optimally substituted ether, 
thioether, ester, amino or alkyl group. Alternatively, R 2 and R 3 together may form a 5 or 



u iiiumumuu nymiuuyuiu mty. Example ui numb? nmuiuuw> ui pamtuidi mimuM am 




c HjS0 2 nh (Formula X) 
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CKj 

CF3 Y^T° 



^COOCH(di) 2 



° (Formula XI) 




H 5 C 2 OOCCH 2 0 N OCHF 2 



(Formula XII) 
(see Miura era/., Brighton Crop Protection 
Conference-Weeds: 35-40 (1993)) 



n— ^>-a 

sc^cooch, (Formula XIII) 



.? F 

^ -CI 

ocHjCOOQsH,, (Formula XIV) 




CH 3 (Formula XV) 
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O 



F 




(Formula XVI) 



The herbicidal activity of the above compounds is described in the Proceedings of 
the 1991 Brighton Crop Protection Conference, Weeds (British Crop Protection Council) 
(Formulae X and XVI), Proceedings of the 1993 Brighton Crop Protection Conference, 
Weeds (British Crop Protection Council) (Formulae XII and XIII), U.S. Patent No. 
4,746,352 (Formula XI) and Abstracts of the Weed Science Society of America vol. 33, 
pg. 9 (1993)(Formula XIV). 

The most preferred imide herbicides are those classified as aryluracils and having 
the general formula 



wherein R signifies the group (C 2 ^-alkenyloxy)carbonyl-Ci.4-alkyl t as disclosed in U.S. 
Patent No. 5,183,492, herein incorporated by reference. 

Also of significance are herbicides having the general formula: 




COOR 



(Formula XVII) 




(Formula XVIII; thiadiazimin) 
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(see Weiler etal., Brighton Crop Protection 
Conference-Weeds, pp. 29-34 (1993)); 



CH 3 CH 2 a 




CH 3 

IT~ .l_ \J\\J . I A ^ _\ 

\rUPIIIUId >M>\, bdl ItM III CU-Ullt:} 

(see Van Saun al., Brighton Crop 
Protection Conference-Weeds: pp. 19- 
22(1993)); 



N-substituted pyrazoles of the general formula: 




(Formula XX) 
(see international patent publications WO 94/08999, 
WO 93/10100, and U. S. Patent No. 5,405,829 assigned to 
Schering); 



N-phenylpyrazoles, such as: 
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(Formula XXI; nipyraclofen) 
(see page 621 of "The Pesticide Manual", 9th ed. t ed. by C.R. 

V¥UI U HI ly, ui nidi • v-'i W(J i iwiowiwn vuunwi, uunu; 

and 3-substituted-2-aryl-4 t 5 ( 6,7-tetrahydroindazoles (Lyga etal. Pesticide Sci. 42.29-36 
(1994)). 



Levels of herbicide which normally are inhibitory to the activity of protox include 
application rates known in the art, and which depend partly on external factors such as 
environment, time and method of application. For example, in the case of the imide 
herbicides represented by Formulae V through IX, and more particularly those 
represented by Formulae X through XVII, the application rates range from 0.0001 to 10 
kg/ha, preferably from 0.005 to 2 kg/ha. This dosage rate or concentration of herbicide 
may be different, depending on the desired action and particular compound used, and 
can be determined by methods known in the art. 

The present invention is further directed to plants, plant tissue and plant seeds 
tolerant to herbicides that inhibit the naturally occurring protox activity in these plants, 
■ifhomin thn tnlnnnra ir rnnfffiTPfl hv an flltPfPfl nrntflU fWVmfi m\WVt PfinffiSfin^tiye 
plants include any plants to which these herbicides are applied for their normally 
intended purpose. Preferred are agronomically important crops, i.e., angiosperms and 
gymnosperms significant as cotton, soya, rape, sugar beet, maize, rice, wheat, barley, 
oats, rye, sorghum, millet, turf, forage, turf grasses and the like. 

By "altered protox enzyme activity" is meant a protox enzymatic activity different 
from that which naturally occurs in a plant (i.e. protox activity which occurs naturally in 
thp absence of direct or indirect manipulation of such activity by man) which is resistant 
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to herbicides that inhibit the naturally occurring activity. Altered protox enzyme activity 
may be conferred upon a plant according to the invention by increasing expression of 
wild-type, herbicide-sensitive protox, expressing an altered, herbicide-tolerant 
eukaryotic protox enzyme in the plant, expressing an unmodified or modified bacterial 
form of the protox enzyme which is herbicide resistant in the plant, or by a combination 
of these techniques. 

Achieving altered protox enzyme activity through increased expression results in a 
level of protox in the plant cell at least sufficient to overcome growth inhibition caused 
by the herbicide. The level of expressed protox generally is at least two times, 
preferably five times, and more preferably at least ten times the natively expressed 
amount. Increased expression may be due to multiple copies of a wild-type protox 
gene; multiple occurrences of the protox coding sequence within the protox gene (i.e. 
gene amplification) or a mutation in the non-coding, regulatory sequence of the 
endogenous protox gene in the plant cell. Plants containing such altered protox 
enzyme activity can be obtained by direct selection in plants. This method is known in 
the art. See, e.g. Somers et al. in U.S. 5,162,602, and Anderson et ai in U.S. 
4,761,373, and references cited therein. These plants also may be obtained via 
genetic engineering techniques known in the art. 

Increased expression of herbicide-sensitive protox also can be accomplished by 
stably transforming a plant cell with a recombinant or chimeric DNA molecule 
comprising a promoter capable of driving expression of an associated structural gene in 
a plant cell, linked to a homologous or heterologous structural gene encoding protox. 
By "homologous," it is meant that the protox gene is isolated from an organism 
taxonomically identical to the target plant cell. By "heterologous," it is meant that the 
jj i ulux gunu u uUlu iii ui l hum u ii u i yu iii um lunu ii un ii uu ll j U i utmul hum t i nu targ g l p lant 
cell. Homologous protox genes can be obtained by complementing a bacterial or yeast 
auxotrophic mutant with a cDNA expression library from the target plant. See, e.g. 
Example 1 and Snustad et al, Genetics 120A 1 1 1-1 1 14 (1988) (maize glutamine 
synthase); Delauney etaL, Mol. Genet 221:299-305 (1990) (soybean -pyrroline 
-5-carboxylate reductase); Frisch etaL, Mol. Gen. Genet. 223:287-293(1991) (maize 
dihydrodipicolinate synthase); Eller et ai, Plant Mol. Biol. Z5.557-566 (1992) (rape 
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chloroplast 3-isopropylmaiate dehydrogenase); Proc. Natl. Acad. Sci, USA 
85:1731-1735 (1991); Minet et ai, Plant J. 2:417-422 (1992) (dihydroorotate 
dehydrogenase) and references cited therein. Other known methods include screening 
genomic or cDNA libraries of higher plants, for example, for sequences that 
cross-hybridize with specific nucleic acid probes, or by screening expression libraries 
for the production of protox enzymes that cross-react with specific antibody probes. A 
preferred method involves complementing an E. colihemG auxotrophic mutant with a 
maize or Arabidopsis thaliana cDNA library. 

Examples of promoters capable of functioning in plants or plant cells, i.e., those 
capable of driving expression of the associated structural genes such as protox in plant 
cells, include the cauliflower mosaic virus (CaMV) 19S or 35S promoters and CaMV 
double promoters; nopaline synthase promoters; pathogenesis-related (PR) protein 
promoters; small subunit of ribulose bisphosphate carboxylase (ssuRUBISCO) 
promoters, and the like. Preferred are the rice actin promoter (McElroy et ai. Mo I. Gen. 
Genet. 231: 150 (1991)), maize ubiquitin promoter (EP 0 342 926; Taylor etal. t Plant 
Cell Rep. 12:491 (1993)), and the Pr-1 promoter from tobacco, Arabidopsis. or maize 
(see International Patent Application No. PCT/IB95/00002 to Ryals ef a/., incorporated 
by reference herein in its entirety). Also preferred are the 35S promoter and an 
enhanced or double 35S promoter such as that described in Kay et aL, Science 236: 
1299-1302 (1987) and the double 35S promoter cloned into pCGN2113, deposited as 
ATCC 40587, which are disclosed in EP-A 0 392 225, the relevant disclosures of which 
are herein incorporated by reference in their entirety. The promoters themselves may 
be modified to manipulate promoter strength to increase protox expression, in 
accordance with art-recognized procedures. 

Signal or transit peptides may be fused to the protox coding sequence in the 
chimeric DNA constructs of the invention to direct transport of the expressed protox 
enzyme to the desired site of action. Examples of signal peptides include those 
natively linked to the plant pathogenesis-related proteins, e.g. PR-1 , PR-2, and the like. 
See, e.g., Payne et al., Plant Mol. Biol. 1 7:89-94 (1988). Examples of transit peptides 
include the chloroplast transit peptides such as those described in Von Heijne etal., 
Plant Mol. Biol, Rep. 5:104-126 (1991); Mazuretal., Plant Physiol. 85: 1110 (1987); 
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Vorst eta/., Gene 65: 59 (1988), and mitochondrial transit peptides such as those 
described in Boutry et al., Nature 328340-342 (1987). Chloroplast and mitochondrial 
transit peptides are contemplated to be particularly useful with the present invention as 
protox enzymatic activity typically occurs within the mitochondria and chloroplast. Most 
preferred for use are chloroplast transit peptides as inhibition of the protox enzymatic 
activity in the chloroplasts is contemplated to be the primary basis for the action of 
protox-inhibiting herbicides (Witkowski and Hailing, Plant Physiol 87:632 (1988); 
Lehnen etal., Pestic. Biochem. PhysioL 37: 239 (1990); Duke etai, Weed ScL 39: 465 
(1991)). Also included are sequences that result in localization of the encoded protein 
to various cellular compartments such as the vacuole. See, for example, Neuhaus et 
al., Proc. Natl. Acad. Sci. USA 88: 10362-10366 (1991) and Chrispeels, Ann. Rev. 
Plant PhysioL Plant Mol. Biol. 42: 21-53 (1991). The relevant disclosures of these 
publications are incorporated herein by reference in their entirety. 

The chimeric DNA construct(s) of the invention may contain multiple copies of a 
promoter or multiple copies of the protox structural genes. In addition, the construct(s) 
may include coding sequences for markers and coding sequences for other peptides 
such as signal or transit peptides, each in proper reading frame with the other 
functional elements in the DNA molecule. The preparation of such constructs are 
within the ordinary level of skill in the art. 

Useful markers include peptides providing herbicide, antibiotic or drug resistance, 
such as, for example, resistance to hygromycin, kanamycin, G418, gentamycin, 
lincomycin, methotrexate, glyphosate, phosphinothricin, or the like. These markers 
can be used to select cells transformed with the chimeric DNA constructs of the 
invention from untransformed cells. Other useful markers are peptidic enzymes which 
can be easily detected by a visible reaction, for example a color reaction, for example 
luciferase, B-glucuronidase, or B-galactosidase. 

Altered protox enzyme activity may also be achieved through the generation or 
identification of modified forms of the isolated eukaryotic protox coding sequence 
having at least one amino acid substitution, addition or deletion which encode an 
altered protox enzyme resistant to a herbicide that inhibits the unaltered, naturally 
occuring form (i.e. forms which occur naturally in a eukaryotic organism without being 
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manipulated, either directly via recombinant DNA methodology or indirectly via selective 
breeding, etc.. by man). Genes encoding such enzymes can be obtained by numerous 
strategies known in the art. A first general strategy involves direct or indirect 
mutagenesis procedures on microbes. For instance, a genetically manipulate microbe, 
e.g. E. cotior S. cerevisiae, may be subjected to random mutagenesis in vivo, with, for 
example UV light or ethyl or methyl methane sulfonate. Mutagenesis procedures are 
described, for example in Miller, Experiments in Molecular Genetics . Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY (1972); Davis etaL Advanced Bacterial 
Genetics . Cold Spring Harbor Laboratory. Cold Spring Harbor. NY (1980); Sherman et 
at., Methods in Yeast Genetics . Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
(1983); and U.S. Patent No. 4,975,374 (Goodman et at). The microbe selected for 
mutagenesis contains a normally herbicide sensitive eukaryotic protox gene and is 
dependent upon the protox activity conferred by this gene. The mutagenized cells are 
grown in the presence of the herbicide at concentrations which inhibit the unmodified 
protox enzyme. Colonies of the mutagenized microbe that grow better than the 
unmutagenized microbe in the presence of the inhibitor (i.e. exhibit resistance to the 
inhibitor) are selected for further analysis. The protox genes from these colonies are 
isolated, either by cloning or by polymerase chain reaction amplification, and their 
sequences elucidated. Sequences encoding an altered protox enzyme are then cloned 
back into the microbe to confirm their ability to confer inhibitor resistance. 

A second method of obtaining mutant herbicide-resistant alleles of the eukaryotic 
protox enzyme involves direct selection in plants. For example, the effect of a protox 
inhibitor such those as described above, on the growth inhibition of plants such as 

art-recognized methods on plates on a simple minimal salts medium containing 
increasing concentrations of the inhibitor. Such concentrations are in the range of 
0.001, 0.003, 0.01, 0.03, 0.1, 0.3, 1,3, 10, 30, 110, 300, 1000 and 3000 parts per 
million (ppm). The lowest dose at which significant growth inhibition can be 
reproducibly detected is used for subsequent experiments. 

Mutagenesis of plant material may be utilized to increase the frequency at which 
resistant alleles occur in the selected population Mutaoenized seed material can be 
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derived from a variety of sources, including chemical or physical mutagenesis or seeds, 
or chemical or physical mutagenesis or pollen (Neuffer, In Maize for Biological 
Research . Sheridan, ed. Univ.Press, Grand Forks, ND., pp. 61-64 (1982)), which is 
then used to fertilize plants and the resulting M-| mutant seeds collected. Typically, for 
Arabidopsis, M2 seeds (Lehle Seeds, Tucson, AZ), i.e. progeny seeds of plants grown 
from seeds mutagenized with chemicals, such as ethyl methane sulfonate, or with 
physical agents, such as gamma rays or fast neutrons, are plated at densities of up to 
10,000 seeds/piate (10 cm diameter) on minimal salts medium containing an 
appropriate concentration of inhibitor to select for resistance. Seedlings that continue 
to grow and remain green 7-21 days after plating are transplanted to soil and grown to 
maturity and seed set. Progeny of these seeds are tested for resistance to the 
herbicide. If the resistance trait is dominant, plants whose seed segregate 
3:1 ::resistant:sensitive are presumed to have been heterozygous for the resistance at 
the M2 generation. Plants that give rise to all resistant seed are presumed to have 
been homozygous for the resistance at the M2 generation. Such mutagenesis on intact 
seeds and screening of their M2 progeny seed can also be carried out on other 
species, for instance soybean (see,e.g. U.S. Pat. No. 5,084,082 (Sebastian)). Mutant 
seeds to be screened for herbicide tolerance can also be obtained as a result of 
fertilization with pollen mutagenized by chemical or physical means. 

Two approaches can be taken to confirm that the genetic basts of the resistance is 
an altered protox gene. First, alleles of the protox gene from plants exhibiting 
resistance to the inhibitor can be isolated using PCR with primers based either upon 
conserved regions in the Arab/dopsis and maize protox cDNA sequences shown in 
SEQ ID NOS:1 ,3,5,7 below or, more preferably, based upon the unaltered protox gene 
sequences from the plant used to generate resistant alleles. After sequencing the 
alleles to determine the presence of mutations in the coding sequence, the alleles can 
be tested for their ability to confer resistance to the inhibitor on plants into which the 
putative resistance-conferring alleles have been transformed. These plants can be 
either Arabidopsis plants or any other plant whose growth is susceptible to the 
inhibitors. Second, the protox genes can be mapped relative to known restriction 
fragment length polymorphisms (RFLPs) (See, for example, Chang etai Proc. Natl. 
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Acad, Sci, USA 55:6856-6860 (1988); Nam etaL, Plant Cell 7.699-705 (1989). The 
resistance trait can be independently mapped using the same markers. If resistance is 
due to a mutation in that protox gene, the resistance trait will map to a position 
indistinguishable from the position of a protox gene. 

A third method of obtaining herbicide-resistant alleles of protox is by selection in 
plant cell cultures. Explants of plant tissue, e.g. embryos, leaf disks, etc. or actively 
growing callus or suspension cultures of a plant of interest are grown on defined 
medium lacking heme in the presence of increasing concentrations of the inhibitory 
herbicide or an analogous inhibitor suitable for use in a laboratory environment. 
Varying degrees of growth are recorded in different cultures. In certain cultures, 
fast-growing variant colonies arise that continue to grow even in the presence of 
normally inhibitory concentrations of inhibitor. The frequency with which such faster- 
growing variants occur can be increased by treatment with a chemical or physical 
mutagen before exposing the tissues or cells to the herbicide. Putative 
resistance-conferring alleles of the protox gene are isolated and tested as described in 
the foregoing paragraphs. Those alleles identified as conferring herbicide resistance 
may then be engineered for optimal expression and transformed into the plant. 
Alternatively, plants can be regenerated from the tissue or cell cultures containing these 
alleles. 

A fourth method involves mutagenesis of wild-type, herbicide sensitive protox genes 
in bacteria or yeast, followed by culturing the microbe on medium that lacks heme, but 
which contains inhibitory concentrations of the inhibitor and then selecting those 
colonies that grow in the presence of the inhibitor. More specifically, a plant cDNA, 
£LL^l£2Jh£^LahkiQnsi^^ protox is cloned into a microbe that 

otherwise lacks protox activity. Examples of such microbes include E. coli, S. 
typhimurium, and S. cerevisiae auxotrophic mutants, including E. coli strain SASX38 
(Sasarman etai, J. Gen. Microbiol. 113:297 (1979), S. typhimurium strain TE2483 or 
TT13680 (Xu et a/.. J. Bacteriol. 174:3953 (1992)), and the hem14-1 yeast mutant 
(Camadro etaL, Biochem. Biophys. Res. Comm. 10&. 724 (1982)). The transformed 
microbe is then subjected to in vivo mutagenesis such as described immediately above, 
or to in vitro mutagenesis bv any of several chemical or enzymatic methods known in 
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the art, e.g. sodium bisulfite (Shortle et aL, Methods Enzymoi 700:457-468 (1983); 
methoxylamine (Kadonaga etal., Nucleic Acids Res. 73:1733-1745(1985); 
oligonucleotide-directed saturation mutagenesis (Hutchinson etal., Proc. Natl. Acad. 
Sci. USA, 53:710-714 (1986); or various polymerase misincorporation strategies (see, 
e.g. Shortle etal., Proc. Natl. Acad. Sci. USA, 79:1588-1592 (1982); Shiraishi etal., 
Gene 64.313-319 (1988); and Leung etal., Technique 7.1 1-15 (1989). Colonies that 
grow in the presence of normally inhibitory concentrations of inhibitor are picked and 
purified by repeated restreakigg. Their plasmids are purified and tested for the ability to 
confer resistance to the inhibitor by retransforming them into the protox-lacking 
microbe. The DNA sequences of protox cDNA inserts from plasmids that pass this test 
are then determined. 

Once a herbicide resistant protox allele is identified, it may be genetically engineered 
for optimal expression in a crop plant. This may include altering the coding sequence 
of the resistance allele for optimal expression in the crop species of interest. Methods 
for modifying coding sequences to achieve optimal expression in a particular crop 
species are well known (see, e.g. Perlak etal, Proc. Natl. Acad. Sci USA 88:3324 
(1 991 ); Koziel et aL, Bio/technoL 1 1: 1 94 (1993)). Genetically engineering the protox 
allele for optimal expression may also include operably linking the appropriate 
regulatory sequences (i.e. promoter, signal sequence, transcriptional terminators). 
Preferred promoters will be those which confer high level constitutive expression or, 
more preferably, those which confer specific high level expression in the tissues 
susceptible to damage by the herbicide. 

The recombinant DNA molecules can be introduced into the plant cell in a number of 
art-recognized ways. Those skilled in the art will appreciate that the choice of method 
might depend on the type of plant, i.e. monocot or dicot. targeted for transformation. 
Suitable methods of transforming plant cells include microinjection (Crossway et at., 
BioTechniques 4.320-334 (1986)), electroporation (Riggs etal, Proc. Natl. Acad. Sci. 
USA S3:5602-5606 (1986), Agrobacterium mediated transformation (Hinchee etal., 
Biotechnology 6:915-921 (1988)), direct gene transfer (Paszkowski etal., EMBOJ. 
32.1} 7-2722 (1984)), and ballistic particle acceleration using devices available from 
Agracetus, Inc., Madison, Wisconsin and Dupont, Inc., Wilmington, Delaware (see, for 
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example, Sanford et a/.. U.S. Patent 4,945.050; and McCabe etaL, Biotechnology 
6323-926 (1988)). Also see, Weissinger etaL, Annual Rev. Genet. 22:421-477 (1988); 
Sanford etaL, Particulate Science and Technology 527-37 (1987)(onion); Christou et 
ai t Plant Physiol. 6Z671-674 (1988)(soybean); McCabe etal. t Bio/Technology 
6:923-926 (1988)(soybean); Datta et al., Bio/Technology 6:736-740 (1990)(rice); Klein 
etaL, Proc. Natl. Acad Sci. USA, 65:4305-4309 (1988)(maize); Klein etaL, 
Bio/Technology 6:559-563 (1988)(maize); Klein etal. t Plant Physiol. 37:440-444 
(1988)(maize); Fromm etaL, Bio/Technology 6.-833-839 (1990); and Gordon-Kamm et 
al., Plant Cell £603-61 8 (1990)(maize). 

Further comprised within the scope of the present invention are transgenic plants, in 
particular transgenic fertile plants transformed by means of the aforedescribed 
processes and their asexual and/or sexual progeny, which still are resistant or at least 
tolerant to inhibition by a herbicide at levels which normally are inhibitory to the 
naturally occurring protox activity in the plant. Very especially preferred are hybrid 
plants which are resistant or at least tolerant to inhibition by a herbicide at levels which 
normally are inhibitory to the naturally occurring protox activity in the plant. 

The transgenic plant according to the invention may be a dicotyledonous or a 
monocotyledonous plant. Preferred are monocotyledonous plants of the Graminaceae 
family involving Lolium. Zea. Triticum. Thticale. Sorghum. Saccharum. Bromus. Orvzae. 
Avena. Hordeum. Secale and Setaria plants. 

Especially preferred are transgenic maize, wheat, barley, sorghum, rye, oats, turf 
grasses and rice. 

Among the dicotyledonous plants soybean, cotton, tobacco, sugar beet, oilseed rape, 

The expression 'progeny' is understood to embrace both, "asexually" and "sexually- 
generated progeny of transgenic plants. This definition is also meant to include all 
mutants and variants obtainable by means of known processes, such as for example 
cell fusion or mutant selection and which still exhibit the characteristic properties of the 
initial transformed plant, together with all crossing and fusion products of the 
transformed plant material. 
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Another object of the invention concerns the proliferation material of transgenic 
plants. 

The proliferation material of transgenic plants is defined relative to the invention as 
any plant material that may be propagated sexually or asexually in vivo or in vitro. 
Particularly preferred within the scope of the present invention are protoplasts, cells, 
calli, tissues, organs, seeds, embryos, pollen, egg cells, zygotes, together with any 
other propagating material obtained from transgenic plants. 

Parts of plants, such as for example flowers, stems, fruits, leaves, roots originating in 
transgenic plants or their progeny previously transformed by means of the process of 
the invention and therefore consisting at least in part of transgenic cells, are also an 
object of the present invention. 

Before the plant propagation material [fruit, tuber, grains, seed], but expecially seed is 
sold as a commerical product, it is customarily treated with a protectant coating 
comprising herbicides, insecticides, fungicides, bactericides, nematicides, molluscicides 
or mixtures of several of these preparations, if desired together with further carriers, 
surfactants or application-promoting adjuvants customarily employed in the art of 
formulation to provide protection against damage caused by bacterial, fungal or animal 
pests. 

In order to treat the seed, the protectant coating may be applied to the seeds either by 
impregnating the tubers or grains with a liquid formulation or by coating them with a 
combined wet or dry formulation. In addition, in special cases, other methods of 
application to plants are possible, eg treatment directed at the buds or the fruit. 

The plant seed according to the invention comprising a DNA sequence encoding a 
protein from a eukaryote having protoporphyrinogen oxidase (protox) activity according 
to the invention may be treated with a seed protectant coating comprising a seed 
treatment compound, such as, for example, captan, carboxin, thiram (TMTD*), 
methalaxy! (Apron*) and pirimiphos-methy! (Actellic®) and others that are commonly 
used in seed treatment- 
It is thus a further object of the present invention to provide plant propagation material 
for cultivated plants, but especially plant seed that is treated with an seed protectant 
coating customarily used in seed treatment. 
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Where a herbicide resistant protox allele is obtained via direct selection in a crop 
plant or plant cell culture from which a crop plant can be regenerated, it may be moved 
into commercial varieties using traditional breeding techniques to develop a herbicide 
tolerant crop without the need for genetically engineering the allele and transforming it 
into the plant. Alternatively, the herbicide resistant allele may be isolated, genetically 
engineered for optimal expression and then transformed into the desired variety. 

Genes encoding altered protox resistant to a protox inhibitor can also be used as 
selectable markers in plant cell transformation methods. For example, plants, plant 
tissue or plant cells transformed with a transgene can also be transformed with a gene 
encoding an altered protox capable of being expressed by the plant. The 
thus-transformed cells are transferred to medium containing the protox inhibitor wherein 
only the transformed cells will survive. Protox inhibitors contemplated to be particularly 
useful as selective agents are the diphenylethers {e.g. acifluorfen, 5-[2-chloro-4- 
(trifluoromethyl)phenoxy]-2-nitrobezoic acid; its methyl ester; or oxyfluorfen, 2-chloro-1- 
(3-ethoxy-4-nitrophenoxy)-4-(trifluorobenzene)}, oxidiazoles. (e.g. oxidiazon, 3-[2,4- 
di<*loro-5-( 1 -methylethoxy)pheny^ 

cyclic imides (e.g. S-23142, A/-(4-chloro-2-fluoro-5-propargyloxyphenyl)-3,4 f 5,6- 
tetrahydrophthalimide; chlorophthalim, A/-(4-chlorophenyl)-3,4,5,6- 
tetrahydrophthalimide), phenyl pyrazoles (e.g. TNPP-ethyl, ethyl 2-[1 -(2,3,4- 
trichlorophenyl)-4-nitropyrazolyl-5-oxy]propionate; M&B 39279), pyridine derivatives 
(e.g. LS 82-556), and phenopylate and its O-phenylpyrrolidino- and 
piperidinocarbamate analogs. The method is applicable to any plant cell capable of 
being transformed with an altered protox-encoding gene, and can be used with any 
transgene of interest. Expression of the transgene and the protox gene can be driven 
by the same promoter functional on plant cells, or by separate promoters. 

The invention will be further described by reference to the following detailed 
examples. These examples are provided for purposes of illustration only, and are not 
intended to be limiting unless otherwise specified. 
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DEPOSITS 

The following vector molecules have been deposited with Agricultural Research 
Service, Patent Culture Collection (NRRL), Northern Regional Research Center, 1815 
North University Street. Peoria, Illinois 61604, U.S.A on the dates indicated below: 

Protox-1, in the pBluescript SK vector, was deposited April 5, 1994 as pWDC-2 ( 
#B-21238). 

Protox-2, in the pFL61 vector, was deposited April 5, 1994 as pWDC-1 (NRRL #B- 
21237). 

MzProtox-1, in the pBluescript SK vector, deposited May 20, 1994 as pWDC-4 with the 
NRRL (#B-21260), shown in SEQ ID NO:5. 

MzProtox-1 , in the pBluescript SK vector, redeposited July 11, 1994 as pWDC-4 with the 
NRRL (#B-21260N), shown in SEQ ID NO:5. 

MzProtox-2, in the pBluescript SK vector, deposited May 20, 1994 as pWDC-3 with the 
NRRL (#B-21259), shown in SEQ ID NO:7. 

Protox-3, in the pFL61 vector, was deposited June 10, 1994 as pWDC-5 
(NRRL #B-21280). 

pMzC-1 Val, in the pBluescript SK vector, was deposited on September 30, 1994 
under the designation pWDC-8 and given the deposit designation NRRL #21340. 

pAraC-2Cys, in the pFL61 vector, was deposited on November 14, 1994 under the 
designation pWDC-7 and given the deposit designation NRRL #21339N. 



EXAMPLES 

Standard recombinant DNA and molecular cloning techniques used here are well 
known in the art and are described by T. Maniatis, E. F. Fritsch and J. Sambrook^ 
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Harbor, NY (1982) and by TJ. Silhavy, M.L Berman. and LW. Enquist, Experiments 
with Gene Fusions . Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1984). 

EXAMPLE 1 : isolation of Arabidopsis cDNAs encoding protox genes by 
functional complementation of an E. coli mutant. 

An Arabidopsis thaliana (Landsberg) cDNA library in the plasmid vector pFL61 
(Minet et aL, Plant J. 2:41 7-422 (1 992)) was obtained and amplified. A second 
Arabidopsis (Columbia) cDNA library in the UniZap lambda vector (Stratagene) was 
purchased and amplified as pBluescript plasmids by mass in vivo excision of the phage 
stock. I he E. col/ hemG mutant SASX38 (Sasarman et ai., J. Gen. Microbiol. 113:297 
(1979)) was obtained and maintained on L media containing 20mg/ml hematin (United 
States Biochemicals). The plasmid libraries were transformed into SASX38 by 
electroporation using the Bio-Rad Gene Pulser and the manufacturer's conditions. The 
cells were plated on L agar containing 100mg/ml ampicillin at a density of 
approximately 500.000 transformants/1 0 cm plate. The cells were incubated at 37° C 
for 40 hours in low light and selected for the ability to grow without the addition of 
exogenous heme. Heme prototrophs were recovered at a frequency of 400/1 0 7 from 
the pFL61 library and at a frequency of 2/1 0 7 from the pBluescript library. Plasmid 
DNA was isolated from 24 colonies for sequence analysis. Each of the 24 was 
retransformed into SASX38 to verify ability to complement. 

Sequence analysis revealed two classes of putative protox clones. Nine were of the 
type designated "Protox- 1 Each was derived from the same gene, and two were full- 
length clones. The cDNA is 1719bp in length and encodes a protein of molecular 
weight 57.7 kDa. The N-terminal peptide sequence has features characteristic of a 
chloroplast transit peptide ot approximately e>0 amino acids. A database search with 
the GAP program (Deveraux et ai, Nucleic Acids Res. 72:387-395 (1984) reveals 
homology with the B. subtilis hemY (protox) protein (Hansson and Hederstedt 1992, 
Dailey etal., J. Bioi Chem. 269: 813 (1994)). The two proteins are 53% similar, 31% 
identical with regions of high homology, including the proposed dinucleotide binding 
domain of the ftemVprotein (Dailey etai, J. Biol. Chem. 269: 813 (1994)). 

The other 15 cDNA clones were of the type designated H Protox-2". These also 
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length and encodes a protein of molecular weight 55.6kD. The amino terminus is 
somewhat characteristic of a mitochondrial transit peptide. The Protox-2 protein has 
limited homology to Protox-1 (53% similar, 28% identical) and to the B. subtilis protox 
(50% similar, 27% identical). 

Protox-1 , in the pBluescript SK vector, was deposited April 5. 1994 as pWDC-2 
(NRRL#B-21238). 

Protox-2, in the pFL61 vector, was deposited April 5, 1994 as pWDC-1 (NRRL #B- 
21237), 

The Arabidoosis cDNA encoding protox-1 contained in pWDC-2 and protox-2 
contained in pWDC-1 are set forth in SEO ID NOS:1 and 3, respectively, beiow. 

EXAMPLE 2: Isolation of Maize cDNAs encoding protox genes by functional 
complementation of an E. coli mutant. 

A Zea Mays (B73 inbred) cDNA library in lambda UniZap was purchased from 
Stratagene and converted to a pBluescript library by mass in vivo excision. A second 
custom-made UniZap maize cDNA library was purchased from Clontech, and similarly 
converted to pBluescript plasmids. Selection for functional protox genes from maize 
was just as described for the Arabidopsis libraries above in Example 1. 

Two heme prototrophs in 1 0 7 transformants were isolated from the Stratagene 
library, shown to recomplement and sequenced. These cDNAs were identical and 
proved to be homologs of Arabidopsis Protox-1 . This maize clone, designated 
MzProtox-1 , is incomplete. The cDNA is 1698bp in length and codes only for the 
putative mature protox enzyme; there is no transit peptide sequence and no initiating 
methionine codon. The gene is 68% identical to Arab Protox-1 at the nucleotide level 
Alili 7U)8 lUeiltllB l ( Of)ft J l ui ll u i ) ui t l iu um iii u uuiU lu i u l (u l ium i i n Tub l u 1)i 

A single heme prototroph in 10 7 transformants was obtained from the Clontech 
library, shown to recomplement, and sequenced. The cDNA appears to be complete, is 
2061 bp in length and encodes a protein of 59 kDa. This clone is a maize homolog of 
Arabidopsis Protox-2 and is designated MzProtox-2. The gene is 58% identical to Arab 
Protox-2 at the nucleotide level and 58% identical (76% similar) at the amino acid level 
(shown in Table 2). The maize clone has an N-terminal sequence that is 30 amino 

tonrj° r ♦bin Arnhidno^ ir Honn Ac w ; th thn Arnb'doD^i c rtonn<~ homoloQv 
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between the two maize protox genes is quite low, with only 31% identity between the 
two protein sequences. 

MzProtox-1 1 in the pBluescript SK vector, deposited May 20, 1 994 as pWDC-4 with the 
NRRL (#B-21260), shown in SEQ ID NO:5. 

MzProtox-1 , in the pBluescript SK vector, redeposited July 1 1 , 1 994 as pWDC-4 
with the NRRL (#B-21260N), shown in SEQ ID NO:5. 

MzProtox-2, in the pBluescript SK vector, deposited May 20, 1994 as pWDC-3 with the 
NRRL (#B-21259), shown in SEQ ID NO:7. 



EXAMPLE 3: Isolation of additional protox genes based on sequence 
homology to known protox coding sequences 

A phage or plasmid library is plated at a density of approximately 10,000 plaques on 
a 10 cm Petri dish, and filter lifts of the plaques are made after overnight growth of the 
plants at 37 C. The plaque lifts are probed with one of the cDNAs set forth in SEQ ID 
NOS:1 , 3, 5 or 7, labeled with 32P-dCTP by the random priming method by means of a 
PrimeTime kit (International Biotechnologies, Inc., New Haven, CT). Hybridization 
conditions are 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 mM EDTA at 
50 C. After hybridization overnight, the filters are washed with 2X SSC. 1% SDS. 
Positively hybridizing plaques are detected by autoradiography. After purification to 
single plaques, cDNA inserts are isolated, and their sequences determined by the chain 
termination method using dideoxy terminators labeled with fluorescent dyes (Applied 
Biosystems, Inc., Foster City, CA). 

The standard experimental protocol described above can be used by one of skill in 
me an to ODiain protox genes sequentially homologous to the known protox coding 
sequences from any other eukaryote, particularly other higher plant species. 

An alignment of the predicted amino acid sequences of the respective proteins 
encoded by the sequences shown in SEQ ID NOS: 2 and 6 are set forth in Table 1 . An 
alignment of the predicted amino acid sequences of the respective proteins encoded by 
the sequences shown in SEQ ID NOS: 4 and 8 are set forth in Table 2. 



TAB! F < 
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Comparison of the Arabidopsis (SEQ ID No. 2) and 
Maize (SEQ ID No. 6) Protox-1 Amino Acid Sequences 

Percent Similarity: 87.137 Percent Identity: 78.008 
Protox-1 .Pep x Mzprotox-1 .Pep 

51 GGTTITTDCVIVGGGISGLCIAQALATKHPDAAPNLIVTEAKDRVGGNII 100 
. • I I I : I I I i f I I I I - I I I I I I : I I I I I :. I . I I I I • 

1 . . . . NSADCWVGGGISGLCTAQALATRH . . GVGDVLVTEARARPGGNIT 4 4 

101 T . . REENGFLWEEGPNSFQPSDPMLTMWDSGLKDDLVLGDPTAPRFVLW 148 
I I • I : I : I f i I I I I I I M I I I : I I I . I I I I I I I | | | : | | | . | | | | | f | 
4 5 TVERPEEGYLWEEGPNSFQPSDPVLTMAVDSGLKDDLVFGDPNAPRFVLW 94 

14 9 NGKLRPVPSKLTDLPFFDLMSIGGKIRAGFGALGIRPSPPGREESVEEFV 198 
: I I I I t II I I - I I f I M I t I I . I I : It I : I II I I I I . I I I M I I I I I II 
95 EGKLRPVPSKPADLPFFDLMSIPGKLRAGLGALGIRPPPPGREESVEEFV 144 

199 RRNLGDEVFERLIEPFCSGVYAGDPSKLSMKAAFGKVWKLEQNGGSIIGG 248 

I I I II . I I I t I I I I I I I I I I I I I I I M I I I I I I I I I I I : 1 I :. I I I I I II 
14 5 RRNLGAEVFERLIEPFCSGVYAGDPSKLSMKAAFGKVWRLEETGGSIIGG 194 

24 9 TFKAIQERKNAPKAERDPRLPKPQGQTVGSFRKGLRMLPEAISARLGSKV 2 98 

I M . I I (I ... I I :. I I : I I I I I . I I II : I I I I I I I I I : || . . . | If j I 
195 TIKTIQERSKNPKPPRDARLPKPKGQTVASFRKGLAMLPNAITSSLGSKV 244 

299 KLSWKLSGITKLESGGYNLTYETPDGLVSVQSKSVVMTVPSHVASGLLRP 348 

I I I I I I I I I : . I I I - I I I I : I : I I I I . I I | : | | : | | . | | I . : I I I 
245 KLSWKLTSITKSDDKGYVLEYETPEGWSVQAKSVIMTIPSYVASNILRP 294 

34 9 LSESAANALSKLYYPPVAAVSISYPKEAIRTECLIDGELKGFGQLHPRTQ 398 

I I . . I I : I I I : : i I I I I I I I . : I I I I I I I I . I M I I I II . I I I I I I I I . I 
2 95 LSSDAADALSRFYYPPVAAVTVSYPKEAIRKECLIDGELQGFGQLHPRSQ 344 

399 GVETLGTIYSSSLFPNRAPPGRILLLNYIGGSTNTGILSKSEGELVEAVD 448 

I I I I I I I I I I I I I I I I I I I . I I : I I I I I I I I . | | | | | : | | . I : I I I I I I I 
345 GVETLGTIYSSSLFPNRAPDGRVLLLNYIGGATNTGIVSKTESELVEAVD 394 



449 RDLRKWLIKPNSTDPLKLGVRVWPQAIPQFLVGHFDILDTAKSSLTSSGY 4 98 

I M I I I I I Ill I I I I I I I I I I I I I I I I I : I : I : . I I . . I . . : M 

395 RDLRKMLINSTAVDPLVLGVRVWPQAIPQFLVGHLDLLEAAKAALDRGGY 444 



4 99 EGLFLGGNYVAGVALGRCVEGAYETAIEVNNFMSRYAYK* 53 8 

: I I I I I I I I I I I I I I I I I I I I I I I . I : : . : | : . : | | | I I 
4 45 DGLFLGGNYVAGVALGRCVEGAYESASQISDFLTKYAYK* 4 84 
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Identical residues are denoted by the vertical bar between the two sequences. 




Alignment is performed using the GAP program described in Deveraux etaL. Nucleic 


Acids Res. 72387-395(1984). 






TABLE 2 






Comparison of the Arabidopsis (SEQ ID No. 4) and 
Maize (SEQ ID NO. 8) Protox-2 Amino Acid Sequences 




Percent Similarity: 75.889 Percent Identity: 57.905 
Protox-2.Pep x Mzprotox-2.Pep 




l 


MASGAVAD . HQIEAVSGKRVAV 


21 


1 


.1 1:1:.: 1 . . : : - 1 1 1 
MLALTASASSASSHPYRHASAHTRRPRLRAVLAMAGSDDPRAAPARSVAV 


50 


22 
51 


VGAGVSGLAAAYKLKSRGLNVTVFEADGRVGGKLRSVMQNGLIWDEGANT 
1 1 1 1 II 1 1 1 1 II : 1 : . 1 : 1 1 1 1 1 1 1 . : 1 . 1 1 1 : 1 - : . 1 : : 1 1 1 1 II 1 
VGAGVSGLAAAYRLRQSGVNVTVFEAADRAGGKIRTNSEGGFVWDEGANT 


71 
100 


72 


MTEAEPEVGSLLDDLGLREKQQFPISQKKRYIVRNGVPVMLPTNPIELVT 
111:1 1 . : . 1 : 1 1 1 II . : 1 1 1 : 1 1 1 . 1 1 1 1 1 : : 1 - 1 . : : 1 . : 1 1 - 1 : . 
MTEGEWEASRLIDDLGLQDKQQYPNSQHKRYIVKDGAPALIPSDPISLMK 


121 


101 


150 


122 


SSVLSTQSKFQILLEPFLWKK. . . . KSSKVSDASAEES VSEFFQRHFGQE 
1 1 1 1 1 1 . II : . : : : 1 1 1 1 : 1 1 . 1 : 1 1 1 : . . 1 1 1 : . 1 : 1 1 1 1 . 1 
SSVLSTKSKIALFFEPFLYKKANTRNSGKVSEEHLSESVGSFCERHFGRE 


167 


151 


200 


168 

201 


WD YLIDPFVGGTS AADPDSLSMKHSFPDLWNVEKSFGS 1 1 VGAIRTKFA 
1 1 1 1 : : 1 1 1 1 : 1 1 1 1 : 1 1 : 1 1 1 : : 1 . 1 1 . 1 1 1 : 1 : . : 1 1 : 1 M 1 1 . 1 : 1 
VVDYFVDPFVAGTSAGDPESLSIRHAFPALWNLERKYGSVIVGAILSKLA 


217 

250 


218 


AKGGKSRDTKSSPGTKKGSRGSFSFKGGMQILPDTLCKSLSHDEINLDSK 
III:. : . . . 1 . I . : : . - 1 . 1 1 1 1 . 1 1 1 1 1 : . 1 - . : : . 1 : : . 1 : . . 
AKGDPVKTRHDSSGKRRNRRVSFSFHGGMQSLINALHNEVGDDNVKLGTE 


267 


251 


300 


268 


VLSLS. . YNSGSRQENWSLSCVSHNETQRQ . . . NPHYDAVIMTAPLCNVK 


312 


301 


MM. : : : • • : . 1 1 : 1 . 1 .:..:: : 1 . : 1 1 M 1 II It : 1 1 : 
VLSLACTFDGVP ALGRWSISVDSKDSGDKDLASNQTFDAVIMTAPLSNVR 


350 


313 


EMKVMKGGQPFQLNFLPEINYMPLSVLITTFTKEKVKRPLEGFGVLIPSK 
II . 1 1 1 . M 1 : M 1 - : : 1 : 1 1 1 : : : 1 M M : M 1 : II 1 II 1 II II 1 
RMKFTKGGAP WLDFLPKMDYLPLSLMVTAFKKDDVKKPLEGFGVLIPYK 


362 


351 


400 


363 


E . QKHGFKTLGTLFSSMMFPDRSPSDVHLYTTFIGGSRNQELAKASTDEL 
j II M : 1 1 M 1 M 1 1 1 M II 1 M M M 1 II 1 : 1 M : 1 . : II MM 1 
EQQKHGLKTLGTLFSSMMFPDRAPDDQYLYTTFVGGSHNRDLAGAPTSIL 
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412 KQWTSDLQRLLGVEGEPVSVNHYYWRKAFPLYDSSYDSVMEAIDKMEND 4 61 

I I : I If I I . : I I I I I I : I . I . I II .Mill: • I . M : M I : I M . : 
451 KQLVTSDLKKLLGVEGQPTFVKHVYWGNAFPLYGHDYSSVLEAIEKMEKN 500 

4 62 LPGFFYAGNHRGGLSVGKSIASGCKAADLVISYLESCSNDKKPNDSL* 509 

f M I I I I I I : M I . I I . M II : I M I I . I I II II : 

501 LPGFFYAGNSKDGLAVGSVIASGSKAADLAISYLESHTKHNNSH* . . . 545 



EXAMPLE 4: Isolation of a contaminating yeast Protox clone from an Arabidopsis 
cDNA library 

In an effort to identify any rare cDNAs with protox activity, a second screen of the 
pFL61 Arabidopsis library was done as before, again yielding hundreds of 
complementing clones. Approximately 600 of these were patched individually onto 
gridded plates and incubated at 28°C for 18 hours. Duplicate filter lifts were made onto 
Colony/Plaque screen (NEN) membranes according to the manufacturer's instructions. 
The Protox- 1 and Protox -2 cDNAs were removed from their vectors by digestion with 
EcoRI/Xhol and by Notl, respectively. The inserts were separated by gel 
electrophoresis in 1.0% SeaPlaque GTG (FMC) agarose, excised, and 32 P-labeled by 
random priming (Life Technologies). One set of lifts was hybridized with each probe. 
Hybridization and wash conditions were as described in Church and Gilbert, 1984. 

Colonies (-20) that failed to show clear hybridization to Protox-1 or Protox-2 were 
amplified in liquid culture and plasmid DNA was prepared. The DNA's were digested 
with Notl, duplicate samples were run on a 1 .0% agarose gel. and then Southern 
blotted onto a Gene Screen Plus (NEN) filter [New England Nuclear]. Probes of the 
two known Protox genes were labeled and hybridized as before. There were two 
identical clones that were not Protox-1 or Protox-2. This clone was shown to 
recomplement the 5A5X38 mutant, although it grows very slowly, and was designated 
Protox-3. 

Protox-3, in the pFL61 vector, was deposited June 8, 1994 as pWDC-5 
(NRRL #B-21280). This coding sequence has been determined to be derived 
from yeast DNA which was present as a minor contaminant in the Arabidopsis 
cDNA library. The yeast DNA encoding protox-3 contained in pWDC-5 is set 
forth in SEQ ID NO:9 below. 
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EXAMPLE 5: Demonstration of plant protox clone sensitivity to protox 
inhibitory herbicides in a bacterial system. 

Liquid cultures of Protox- 1/SASX38, Protox-2/SASX38 and pBluescript/XL1-Blue 
were grown in L amp 100 . One hundred microliter aliquots of each culture were plated 
on L amp 100 media containing various concentrations (1 .0nM-10mM) of a protox 
inhibitory aryluracil herbicide of formula XVII. Duplicate sets of plates were incubated 
for 18 hours at 37° C in either low light or complete darkness. 

The protox+ E. constrain XL1-Blue showed no sensitivity to the herbicide at any 
concentration, consistent with reported resistance of the native bacteria! enzyme io 
similar herbicides. The Protox- 1/SASX38 was clearly sensitive, with the lawn of 
bacteria almost entirely eliminated by inhibitor concentrations as low as 10nM. The 
Protox-2/SASX38 was also sensitive, but only at a higher concentration (10 M) of the 
herbicide. The effect of the herbicide on both plant protox strains was most dramatic in 
low light, but was also apparent on plates maintained entirely in the dark. The toxicity 
of the herbicide was entirely eliminated by the addition of 20mg/ml hematin to the 
plates. 

The different herbicide tolerance between the two plant Protox strains is likely the 
result of differential expression from these two plasmids, rather than any inherent 
difference in enzyme sensitivity. Protox-1/SASX38 grows much more slowly than 
Protox-2/SASX38 in any heme-deficient media. In addition, the MzProtox-2/SASX38 
strain, with a growth rate comparable to Arab Protox- 1/SASX38, is also very sensitive 
to herbicide at the lower (10-100nM) concentrations. Initial characterization of the yeast 
Protox-3 clone indicated that it also is herbicide sensitive. 



EXAMPLE 6: Selecting for plant protox genes resistant to protox-inhibitory 
herbicides in the E. colt expression system. 

Inhibition of plant protox enzymes in a bacterial system is useful for large-scale 

screening for herbicide-resistant mutations in the plant genes. Initial dose response 

experiments, done by plating from liquid cultures, gave rise to high frequency "resistant" 

colonies even at high concentrations of herbicide. This resistance was not plasmid- 

K^nn h^^orl rptranqf ormation/herhicidp ^pr^itivity assav Transforming Protox 
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plasmids into the SASX38 mutant and plating directly onto plates containing herbicide 

reduces this background problem almost entirely. 

The plant protox plasmids are mutagenized in a variety of ways, using published 

procedures for chemical {e.g. sodium bisulfite (Shortle etai.. Methods Enzymol. 

100:457-468 (1983); methoxylamine (Kadonaga etai., Nucleic Acids Res. 

73:1733-1745 (1985); oligonucleotide-directed saturation mutagenesis (Hutchinson et 
ai, Proc. Natl. Acad. Sci USA, 83:710-714 (1986); or various polymerase 
misincorporation strategies (see, e.g. Shortle et ai., Proc. Nati. Acad. Sci. USA, 

79:1588-1592 (1982); Shiraishi etai., Gene 54.313-319 (1988); and Leung etai, 
Technique 1:1 1-15 (1989)). The expected up-promoter mutants from whole-plasmid 
mutagenesis are eliminated by recloning the coding sequence into a wild-type vector 
and retesting. Given that higher expression is likely to lead to better growth in the 
absence of herbicide, a visual screen for coding sequence mutants is also possible. 

Any plant protox gene expressing herbicide resistance in the bacterial system may 
be engineered for optimal expression and transformed into plants using standard 
techniques as described herein. The resulting plants may then be treated with 
herbicide to confirm and quantitate the level of resistance conferred by the introduced 
protox gene. 

EXAMPLE 7: Constructs for Expression of herbicide-resistant microbial protox 
gene(s) in plants. 

The coding sequences for the B. subtifis protox gene hemY (Hansson and 
Hederstedt, J. Bacteriot. 174: 8081 (1992); Dailey etai., J. Biol. Chem. 269: 813 
(1994)) and for the E. coli protox gene hemG (Sasarman etai, Can. J. Microbiol. 39: 
1 155 (1993)) were isolated from laboratory strains by PCR amplification using standard 
conditions and flanking primers designed from the published sequences. These genes 
are known to code for herbicide-resistant forms of the protox enzyme. 

Using standard techniques of overlapping PCR fusion (Ausubel etai. Current 
Protocols in Molecular Biology . John Wiley & Sons, Inc. (1994)), both bacterial genes 
were fused to two different Arabidopsis chloroplast transit peptide sequences (CTPs). 
The first was the CTP from the acetohydroxy acid synthase (AHAS, Mazur etai., Plant 
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The second was from the Arabidopsis plastocyanin gene (Vorst et al., Gene 65: 59 
(1988)), which has a bipartite transit peptide. The amino terminal portion of this CTP 
targets the protein into the chloroplast, where the carboxy terminus routes it into the 
thylakoid membranes. All four gene fusions were cloned behind the 2X35S promoter in 
a binary expression vector designed for production of transgenic plants by 
agrobacterium transformation. 

Following isolation of the Arabidopsis and maize protox cDNAs, the chloroplast 
transit peptide from Protox- 1 or MzProtox-1 may also be fused to the two bacterial 
protox proteins in the same manner as above. 

The vectors described above may then be transformed into the desired plant 
species and the resulting transformants assayed for increased resistance to herbicide. 

EXAMPLE 8: Domain switching between Arabidopsis/B. subtiiis genes to 
produce chimeric, herbicide resistant protox. 

One approach that may be used to generate a protox gene which is both herbicide 

resistant and capable of providing effective protox enzymatic activity in a plant is to 

fuse portion(s) of a bacterial and plant protox gene. The resulting chimeric genes may 

then be screened for those which are capable of providing herbicide resistant protox 

activity in a plant cell. For instance, the Arabidopsis and the B. subtiiis (hemY) protox 

peptide sequences are reasonably colinear with regions of high homology. The hemY 

coding sequence is cloned into pBluescript and tested for its ability to express 

herbicide-resistant protox activity in SASX38. Protox- MhemY chimeric genes are 

constructed using fusion PCR techniques, followed by ligation back into the pBluescript 

vector. The initial exchange is approximately in the middle of the proteins. These 

fusions are tested tor protox runction Dy complementation, ana then assayed for 

herbicide resistance by plating on herbicide with intact Protox- 1 and hemY controls. 

EXAMPLE 9: Production of herbicide-tolerant plants by overexpression of 
plant protox genes. 

To express the Arabidopsis or maize protein in transgenic plants, the appropriate full 
length cDNA was inserted into the plant expression vector pCGN1761 ENX, which was 

dprivpri from o^nN 1 " 7 ^ 1 ^ follow^ n^GNl 7 ^ wn^ dine^ted at itp 'jniquo FrnRI sifn 
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and ligated to a double-stranded DNA fragment comprised of two oligonucleotides of 
sequence 5' AAT TAT GAC GTA ACG TAG GAA TTA GCG GCCC GCT CTC GAG T 3' 
(SEQ ID NO: 1 1 ) and 5' AAT TAG TCG AGA GCG GCC GCG AAT TCC TAG GTT ACG 
TCA T 3' (SEQ ID NO: 12). The resulting plasmid, pCGN1761ENX, contained unique 
EcoRI. Notl, and Xhol sites that lie between a duplicated 35S promoter from cauliflower 
mosaic virus (Kay et al.. Science 235.1299-1302 (1987)) and the 3' untranslated 
sequences of the fm/gene of Agrobacterium tumefaciens. This plasmid is digested and 
ligated to a fragment resulting from restriction enzyme digestion of one of the plasmids 
bearing a protox cDNA, such that it carries the complete protox cDNA. From this 
plasmid is excised an Xbal fragment comprising the Arabidopsis protox cDNA flanked 
by a duplicated 35S promoter and the 3' untranslated sequences of the fm/gene of A. 
tumefaciens. This Xbal fragment is inserted into the binary vector pCIB200 at its 
unique Xbal site, which lies between T-DNA border sequences. The resulting plasmid, 
designated pCIB200protox. is transformed into A. tumefaciens strain CIB542. See.e.g. 
Uknes et al. Plant Cell 5:1 59-1 69 (1 993). 

Leaf disks of Nicotiana tabacum cv. Xanthi-nc are infected with A. tumefaciens 
CIB542 harboring pCIB200IGPD as described by Horsch et al. Science 227: 1229 
(1985). Kanamycin-resistant shoots from 15 independent leaf disks are transferred to 
rooting medium, then transplanted to soil and the resulting plants grown to maturity in 
the greenhouse. Seed from these plants are collected and germinated on MS agar 
medium containing kanamycin. Multiple individual kanamycin resistant seedlings from 
each independent primary transformant are grown to maturity in the greenhouse, and 
their seed collected. These seeds are germinated on MS agar medium containing 
kanamycin. 

Plant lines that give rise to exclusively kanamycin resistant seedlings are 
homozygous for the inserted gene and are subjected to further analysis. Leaf disks of 
each of the 15 independent transgenic lines are excised with a paper punch and placed 
onto MS agar containing various increasing concentrations of a protox inhibitory 
herbicide. 
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After three weeks, two sets of 1 0 disks from each line were weighed, and the results 
recorded. Transgenic lines more resistant to the inhibitor than wild type, 
non-transformed plants are selected for further analysis. 

RNA is extracted from leaves of each of these lines. Total RNA from each 
independent homozygous line, and from non-transgenic control plants, is separated by 
agarose gel electrophoresis in the presence of formaldehyde (Ausubel et aL, Current 
Protocols in Molecular Biology . Wiley & Sons, New York (1987)). The gel is blotted to 
nylon membrane (Ausubel et aL, supra.) and hybridized with the radiolabeled 
Arabidopsis protox cDNA. Hybridization and washing conditions are as described by 
Church and Gilbert, Proc. Natl. Acad Sci. USA 87:1991-1995 (1984). The filter is 
autoradiographed, and intense RNA bands corresponding to the protox transgene are 
detected in all herbicide-tolerant transgenic plant lines. 

To further evaluate resistance of the protox -overexpressing line, plants are grown in 
the greenhouse and treated with various concentrations of a protox-inhibiting herbicide. 

EXAMPLE 10: Growth of tobacco cell suspension cultures 

Media : 

MX1: This medium consists of Murashige and Skoog ("MS", T. Murashige et al., 
Physiol. Plant. 75:473-497, 1962) major salts, minor salts and Fe-EDTA (Gibco # 500- 
1117; 4.3 g/l), 100 mg/1 myo-inositol, 1 mg/1 nicotinic acid, 1 mg/1 pyridoxine-HC1, 10 
mg/1 thiamine -HC1, 2-3 g/1 sucrose, 0.4 mg/1 2 t 4-dichlorophenoxyacetic acid, and 
0.04 mg/1 kinetin, pH 5.8. The medium is sterilized by autoclaving. 



N6: This medium comprises macroelements, microelements and Fe-EDTA as 
described by C-C. Chu etal. t Scientia Sinica 70.659 (1975), and the following organic 
compounds: Pyridoxine-HC1 (0.5 mg/1), thiamine-HC1 (0.1 mg/1), nicotinic acid (0.5 
mg/1), glycine (2.0 mg/1), and sucrose (30.0 g/1). The solution is autoclaved. The final 
pH is 5.6. 
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Remarks: Macroelements are made up as a 10 X concentrated stock solution, and 
microelements as a 1000 X concentrated stock solution. Vitamin stock solution is 
normally prepared 100 X concentrated. 

Suspension cultured cells of Nicotiana tabacum, line S3 [Harms and DiMaio, J Plant 
Physiol 137, 513-519, 1 991] are grown in liquid culture medium MX1. 100 ml 
Erlenmeyer flasks containing 25 ml medium MX1 are inoculated with 10 ml of a cell 
culture previously grown for 7 days. Cells are incubated at 25 C in the dark on an 
orbital shaker at 1 00 rpm (2 cm throw). Cells are subcultured at 7 day intervals by 
inoculating an aliquot sample into fresh medium, by decanting or pipetting off around 
90% of the cell suspension followed by replenishing fresh medium to give the desired 
volume of suspension. 5-8 grams of fresh weight cell mass are produced within 10 
days of growth from an inoculum of 250-350 mg cells. 

EXAMPLE 1 1 : Production of tobacco cell cultures tolerant to herbicidal protox 
inhibitors by plating cells on solidified selection medium 

Cells are pregrown as in Example 1 0. Cells are harvested by allowing cells to 
sediment, or by brief centrifugation at 500 x g. and the spent culture medium is 
removed. Cells are then diluted with fresh culture medium to give a cell density suitable 
for cell plating, about 1 0,000 colony forming units per ml. For plating, cells in a small 
volume of medium (approx. 1 ml) are evenly spread on top of solidified culture medium 
(MX1 , 0.8% agar) containing the desired concentration of the inhibitor. About 20-30 ml 
of medium are used per 10 cm Petri plate. The suitable inhibitor concentration is 
determined from a dose-response curve (Example 14), and is at least twofold higher 
ma il me i oru uf sensing m i iHypi m ii i. 

Culture plates containing cells spread onto selection medium are incubated under 
normal growth conditions at 25-28 C in the dark until cell colonies are formed. 
Emerging cell colonies are transferred to fresh medium containing the inhibitor in the 
desired concentration. 

In a preferred modification of the described method the pregrown suspension of 
cultured cells is first spread in a small volume of liquid medium on top of the solidified 
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around 40 C is added and the plate gently but immediately swiried to spread the cells 
evenly over the medium surface and to mix cells and agar medium, before the medium 
solidifies. 

Alternatively, the cells are mixed with the molten agar medium prior to spreading on 
top of the selection medium. This method has the advantage that the cells are 
embedded and immobilized in a thin layer of solidified medium on top of the selection 
medium. It allows for better aeration of the cells as compared to embedding cells in the 
whole volume of 20-30 ml. 

EXAMPLE 12: Production of tobacco cell cultures tolerant to a herbicidal protox 
inhibitor by growing cells in liquid selection medium 

Cells cultured as in Example 10 are inoculated at a suitable cell density into liquid 
medium MX1 containing the desired concentration of a herbicidal protox inhibitor. Cells 
are incubated and grown as in Example 10. Cells are subcultured, as appropriate 
depending on the rate of growth, using fresh medium containing the desired inhibitor 
concentration after a period of 7-10 days. 

Depending on the inhibitor concentration used, cell growth may be slower than in the 
absence of inhibitor. 

EXAMPLE 13: Production of tobacco cells with enhanced levels of protox 
enzyme 

In order to obtain cell cultures or callus with enhanced levels of protox enzyme, 
suspension cultures or callus are transferred, in a step-wise manner, to increasingly 
higher concentrations of herbicidal protox inhibitor. In particular, the following steps are 
pUIIUflllUtJ. 

Cell colonies emerging from plated cells of Example 1 1 are transferred to liquid 
MX1 medium containing the same concentration of protox inhibitor as used in the 
selection according to Example 1 1 in order to form suspension cultures. Alternatively, 
selected cell suspension cultures of Example 12 are subcultured in liquid MX1 medium 
containing the same concentration of protox inhibitor as used for selection according to 
Example 12. 
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Cultures are subcultured 1-20 times at weekly intervals and are then subcultured 
into MX1 medium containing the next higher herbicide concentration. The cells are 
cultured for 1-10 subcultures in medium containing this higher concentration of 
herbicide. The cells are then transferred to MX1 medium containing the next higher 
concentration of herbicide. 

Alternatively, pieces of selected callus of Example 1 1 are transferred to solidified 
MX1 medium supplemented with the desired herbicide concentration. Transfer to 
higher herbicide concentrations follows the procedure outlined in the preceding 
paragraph except that solidified medium is used. 

EXAMPLE 14: Measuring herbicide dose-dependent growth of cells in 
suspension cultures 

In order to obtain a dose-response curve the growth of cells at different 
concentrations of herbicide is determined. Suspension culture cells of herbicidal protox 
inhibitor sensitive wild-type tobacco cells S3 and herbicide tolerant selected or 
transgenic cells S3 and herbicide tolerant selected or transgenic cells are pregrown in 
liquid medium as in Example 1 1 at a high cell density for 2-4 days. The cells are 
washed free of spent medium and fresh medium without herbicide is added to give the 
desired cell density (about 150 mg FW cells per ml of suspension). A sample of 2.5 ml 
of cell suspension, containing approx. 250-300 mg FW cells, is then inoculated into 
approx. 30 ml of liquid medium of desired herbicide concentration contained in a 100 ml 
Erlenmeyer flask. Care is taken to inoculate the same amount of cells into each flask. 
Each flask contains an equal volume of medium. 3-6 replicate flasks are inoculated per 
im i bioido oowoimtral i twi Tku tm i*m4***+mmtmt**4++&mto i tr a m ur o ( control), 
0.1 ppb, 0.3 ppb, 1 ppb, 3 ppb, 10 ppb, 30 ppb, 100 ppb, 300 ppb, 1000 ppb, 3000 
ppb, and 10,000 ppb. Several samples of inoculated cells are also taken at the time of 
inoculation to determine the mass of cells inoculated per flask. 

Cells are then incubated for growth under controlled conditions at 28 in the dark for 
10 days. The cells are harvested by pouring the contents of each flask onto a filter 
paper disk attached to a vacuum suction device to remove all liquid and to obtain a 
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mass of reasonably dry fresh cells. The fresh mass of cells is weighed. The dry weight 
of samples may be obtained after drying. 

Cell growth is determined and expressed as cell gain within 10 days and expressed 
as a percentage relative to cells grown in the absence of herbicide according to the 
formula: (final mass of herbicide-grown cells minus inoculum mass x 100 divided by 
final mass of cells grown without herbicide minus inoculum mass). IC50 values are 
determined from graphs of plotted data (relative cell mass vs. herbicide concentration). 
IC50 denotes the herbicide concentration at which cell growth is 50% of control growth 
(cells grown in the absence of herbicide). 

In a modification of the method several pieces of callus derived from a herbicide 
resistant cell culture, as obtained in Examples 1 1 and 13, are transferred to solidified 
callus culture medium containing the different herbicide concentrations. Relative 
growth is determined after a culture period of 2-6 weeks be weighing callus pieces and 
comparing to a control culture grown in medium without herbicide. However, the 
suspension method is preferred for its greater accuracy. 

EXAMPLE 15: Determination of cross tolerance 

In order to determine the extent at which cells show tolerance to analogous or other 
herbicides, Example 14 is repeated by growing cells in increasing concentrations of 
chosen herbicides. The relative growth of the cells and their IC50 value is determined 
for each herbicide for comparison. 

EXAMPLE 16: Determining the stability of the herbicide tolerance phenotype 
over time 

I n Qf riB F iu liulu iiiii nu n li ulhu i t li u I il i LhUUl iuieiB i u phe i iuiype uf A LfeH tUUUm u> 
maintained over time, cells are transferred from herbicide containing medium to 
medium without herbicide. Cells are grown, as described in Example 10, in the 
absence of herbicide for a period of 3 months, employing regular subculturing at 
suitable intervals (7-10 days for suspension cultures; 3-6 weeks for callus cultures). A 
known quantity of cells is then transferred back to herbicide containing medium and 
cultured for 10 days (suspension cultures) or 4 weeks (callus cultures). Relative growth 
io d^tprminnrj 3^ in Fxnmpl^ 14 
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EXAMPLE 17: Induction and culture of embryogenic callus from corn scutellum 
tissue 

Ears are harvested from self pollinated corn plants of the inbred line Funk 271 7 12- 

14 days post pollination. Husks are removed and the ears are sterilized for about 15 

minutes by shaking in a 20% solution of commercial Chlorox bleach with some drops of 

detergent added for better wetting. Ears are then rinsed several times with sterile 

water. All further steps are performed aseptically in a sterile air flow hood. Embryos of 

1 .5-2.5 mm length are removed from the kernels with a spatula and placed, embryo 

axis downwards, onto MS culture medium containing 2 mg/1 2,4-dichlorophenoxyacetic 

R 

acid (2,4-D) and 3% sucrose, solidified with 0.24% Gelrite . 

Embryogenic callus forms on the scutellum tissue of the embryos within 2-4 weeks 
of culture at about 28 C in the dark. The callus is removed from the explant and 
transferred to fresh solidified MS medium containing 2 mg/1 2,4-D. The subculture of 
embryogenic callus is repeated at weekly intervals. Only callus portions having an 
embryogenic morphology are subcultured. 



EXAMPLE 18: Selection of corn cell cultures tolerant to herbicidal protox 
inhibitors 

a) Selection using embryogenic callus: Embryogenic callus of Example 17 is 
transferred to callus maintenance medium consisting of N6 medium containing 2 mg/1 
2,4-D, 3% sucrose and protox inhibitor at a concentration sufficient to retard growth, but 
that does not affect the embyrogenicity of the culture, and solidified with 0.24% 
Gelrite' 1 ^. To increase the frequency of herbicide tolerant mutations, cultures can be 
pretreated before selection with a chemical mutagen, e.g. ethylmethane sulfonate, or a 
physical mutagen, e.g. UV light, at a concentration just below the concentration at 
which growth inhibition is detected, as determined in Example 14. Cultures are 
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incubated at 28 C in the dark. After 14 days growing callus is transferred to fresh 
medium of the same composition. Only cultures with the desired embryogenic 
morphology known as friable embryogenic callus of type II morphology are subcultured. 
Cultures are propagated by subculturing at weekly intervals for two to ten subcultures 
on fresh medium whereby only the fastest growing cultures are subcultured. The fast 
growing callus is then transferred to callus maintenance medium containing a protox 
inhibiting herbicide at a suitable concentration as defined in Example 11. When callus 
grows well on this herbicide concentration, usually after about five to ten weekly 
subcultures, the callus is transferred to callus maintenance medium containing a three- 
fold higher concentration of inhibitor, and subcultured until a well growing culture is 
obtained. This process is repeated using medium containing protox inhibitor at a 
concentration 10-fold higher than the original suitable concentration, and again with 
medium containing 20-fold and 40-fold higher concentrations. 

When sufficient callus has been produced it is transferred to regeneration medium 
suitable for embryo maturation and plant regeneration. Embryogenic callus growing on 
each of the herbicide concentrations used is transferred to regeneration medium. 

b) Selection using embryogenic suspension cultures : Embryogenic suspension 
cultures of corn Funk inbred line 2717 are established according to Example 24 and 
maintained by subculturing at weekly intervals to fresh liquid N6 medium containing 2 
mg/1 2,4-D. To increase the frequency of herbicide tolerant mutations, cultures can be 
treated at this time with a chemical mutagen, e.g. ethylmethane sulfonate, at a 
concentration just below the concentration at which growth inhibition is detected, as 
determined in Example 14. For selection, the cultures are transferred to liquid Nt> 
medium containing 2 mg/1 2,4-D and a concentration of inhibitor sufficient to retard 
growth, but that does not affect the embyrogenicity of the culture. Cultures are grown 
on a shaker at 120 rpm at 28 C in the dark. At weekly intervals, the medium is 
removed and fresh medium added. The cultures are diluted with culture medium in 
accord with their growth to maintain about 10 ml of packed cell volume per 50 ml of 
medium. At each subculture, cultures are inspected and only fast growing cultures with 
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two to ten subcultures in N6 medium containing, cultures are increasing in growth rate 

at least two- to threefold per weekly subculture. The cultures are then transferred to N6 

medium containing 2 mg/l 2,4-D and a three-fold higher dose of inhibitor than originally 

used. Growing cultures are repeatedly subcultured in this medium for another two to 

ten subcultures as described above. Fast growing cultures with the desired friable 

embryogenic morphology are selected for further subculture. Fast growing cultures are 

then transferred to N6 medium containing 2 mg 2 f 4-D and a ten-fold higher 

concentration of inhibitor than originally used, and the process of subculturing growing 

cultures with the desired friable embryogenic morphology is repeated for two to ten 

subcultures until fast growing cultures are obtained. These cultures are then 

transferred to N6 medium containing 2 mg/1 2,4-D and a 30-fold higher concentration 

of inhibitor than originally used. 

For regeneration of plants from each embryogenic suspension culture selected with 

the mentioned herbicide concentration level, the cultures are first transferred onto N6 

p 

medium solidified with 0.24% Gelrite and containing 2 mg/1 2,4-D and, optionally, the 
concentration of inhibitor in which the cultures have been growing, to produce 
embryogenic callus. The embryogenic callus is subcultured onto fresh callus 
maintenance medium until a sufficient amount of callus is obtained for regeneration. 
Only cultures with the desired embryogenic morphology are subcultured. 

EXAMPLE 19: Regeneration of corn plants form selected callus or suspension 
culture 

Plants are regenerated from the selected embryogenic callus cultures of Example 13 

by transferring to fresh regeneration medium. Regeneration media used are: 0N6 

medium consisting ot N6 medium lacking 2,4-3, or N61 consisting of N6 medium 

containing 0.25 mg/1 2,4-D and 10 mg/1 kinetin (6-furfurylaminopurine), or N62 

consisting of N6 medium containing 0.1 mg/1 2,4-D and 1 mg/1 kinetin, all solidified 
p 

with 0.24% Gelrite . Cultures are grown at 28 C in the light (1 6 h per day of 1 0-1 00 

2 

n£insteins/m sec from white fluorescent lamps). The cultures are subcultured every 
two weeks onto fresh medium. Plantlets develop within 3 to 8 weeks. Plantlets at least 
2 cm tall are removed from adhering callus and transferred to root promoting medium. 
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Different root promoting media are used. The media consist of N6 or MS medium 
lacking vitamins with either the usual amount of salts or with salts reduced to one half, 
sucrose reduced to 1 g/1 , and further either lacking growth regulating compounds or 
containing 0.1 mg/1 a-naphthaleneacetic acid. Once roots are sufficiently developed, 
plantlets are transplanted to a potting mixture consisting of vermiculite, peat moss and 
garden soil. At transplanting all remaining callus is trimmed away, all agar is rinsed off 
and the leaves are clipped about half. Plantlets are grown in the greenhouse initially 
covered for some days with an inverted clear plastic cup to retain humidity and grown 
with shading. After acclimatization plants are repotted and grown to maturity. Fertilizer 
Peters 20-20-20 [Grace Sierra] is used to ensure healthy plant development, upon 
flowering plants are pollinated, preferably self pollinated. 

EXAMPLE 20: Construction of Plant Transformation Vectors 

Numerous transformation vectors are available for plant transformation, and the 
genes of this invention can be used in conjunction with any such vectors. The selection 
of vector for use will depend upon the preferred transformation technique and the 
target species for transformation. For certain target species, different antibiotic or 
herbicide selection markers may be preferred. Selection markers used routinely in 
transformation include the nptll gene which confers resistance to kanamycin and 
related antibiotics (Messing & Vierra, Gene 19: 259-268 (1982); Bevan etal., Nature 
304.184-187 (1983)), the bargene which confers resistance to the herbicide 
phosphinothricin (White et aL Nucl Acids Res 18: 1062 (1990), Spencer etai Theor 
Appl Genet 79: 625-631(1990)), the hph gene which confers resistance to the antibiotic 
hygromycin (Blochinger & Diggelmann. Mol Cell Biol 4: 2929-2931), and the dhfrgene, 
which confers resistance to methotrexate (Bourouis et aL EMBO J, 2(7): 1099-1 104 
(1983)). 

(1 ) Construction of Vectors Suitable for Agrobacterium Transformation 

Many vectors are available for transformation using Agrobacterium tumefaciens. These 

typically carry at least one T-DNA border sequence and include vectors such as pBIN19 
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(Bevan, Nucl. Acids Res. (1984)). Below the construction of two typical vectors is 
described. 

Construction of PCIB200 and PCIB2001 

The binary vectors pCIB200 and pCIB2001 are used for the construction of 
recombinant vectors for use with Agrobacterium and was constructed in the following 
manner. pTJS75kan was created by Narl digestion of pTJS75 (Schmidhauser & 
Helinski. J BacterioL 164: 446-455 (1985)) allowing excision of the tetracycline- 
resistance gene, followed by insertion of an Accl fragment from pUC4K carrying an 
NPTII (Messing & Vierra, Gene 19: 259-268 (1982); Bevan etaL, Nature 304: 184-187 
(1 983); McBride et a/.. Plant Molecular Biology 14: 266-276 (1 990)). Xhol linkers were 
ligated to the EcoRV fragment of pC!B7 which contains the left and right T-DNA 
borders, a plant selectable nos/nptll chimeric gene and the pUC polylinker (Rothstein et 
a/.. Gene 53: 153-161 (1987)), and the XrtoAdigested fragment was cloned into Salt- 
digested pTJS75kan to create pCIB200 (see also EP 0 332 104, example 19 [1338J). 
pCIB200 contains the following unique polylinker restriction sites: EcoRI, Sstl, Kpnl, 
Bglll, Xbal, and Sail. pCIB2001 is a derivative of pCIB200 which created by the 
insertion into the polylinker of additional restriction sites. Unique restriction sites in the 
polylinker of pCIB2001 are EcoRI, Sstl, Kpnl t Bglll, Xbal, Sail, Mlul Bell. Avrll, Apal t 
Hpal t and StuL pCIB2001, in addition to containing these unique restriction sites also 
has plant and bacterial kanamycin selection, left and right T-DNA borders for 
Agrobacterium-med\ated transformation, the RK2-derived trfA function for mobilization 
between E. coli and other hosts, and the Or/Tand Or/V functions also from RK2. The 
pCIB2001 polylinker is suitable for the cloning of plant expression cassettes containing 
their own regulatory signals. 

Construction of pCIBIO and Hygromvcin Selection Derivatives thereof 

The binary vector pCIBIO contains a gene encoding kanamycin resistance for 
selection in plants, T-DNA right and left border sequences and incorporates sequences 
from the wide host-range plasmid pRK252 allowing it to replicate in both E. co//and 
Agrobacterium. Its construction is described by Rothstein etaL, Gene 53: 153-161 
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(1987). Various derivatives of pCIBIO have been constructed which incorporate the 
gene for hygromycin B phosphotransferase described by Gritz et al. t Gene 25: 179-188 
(1 983)). These derivatives enable selection of transgenic plant cells on hygromycin 
only (pCIB743), or hygromycin and kanamycin (pCIB715, pCIB717) [Rothstein ef a/., 
Gene 53: 153-161 (1987)]. 

(2) Construction of Vectors Suitable for non- Agrobacterium Transformation. 

Transformation without the u§e of Agrobacterium tumefaciens circumvents the 
requirement for T-DNA sequences in the chosen transformation vector and 
consequently vectors lacking these sequences can be utilized in addition to vectors 
such as the ones described above which contain T-DNA sequences. Transformation 
techniques which do not rely on Agrobacterium include transformation via particle 
bombardment, protoplast uptake (e.g. PEG and electroporation) and microinjection. 
The choice of vector depends largely on the preferred selection for the species being 
transformed. Below, the construction of some typical vectors is described. 

Construction of pCIB3064 

pCIB3064 is a pUC-derived vector suitable for direct gene transfer techniques in 
combination with selection by the herbicide basta (or phosphinothricin). The plasmid 
pCIB246 comprises the CaMV 35S promoter in operational fusion to the E. col/GUS 
gene and the CaMV 35S transcriptional terminator and is described in the PCT 
published application WO 93/07278. The 35S promoter of this vector contains two 
ATG sequences 5' of the start site. These sites were mutated using standard PCR 
techniques in such a way as to remove the ATGs and generate the restriction sites 
Sspl and Pvuli The new restriction sites were 96 and 37 bp away from the unique Sail 
site and 101 and 42 bp away from the actual start site. The resultant derivative of 
pCIB246 was designated pCIB3025. The GUS gene was then excised from pCIB3025 
by digestion with Sail and Sacl, the termini rendered blunt and religated to generate 
plasmid pCIB3060. The plasmid pJIT82 was obtained from the John Innes Centre, 
Norwich and the a 400 bp Smal fragment containing the bar gene from Streptomyces 
viridochromogenes was excised and inserted into the HpalsWe of pCIB3060 
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(Thompson etal. EMBO J 6: 2519-2523 (1987)). This generated pCIB3064 which 
comprises the bar gene under the control of the CaMV 35S promoter and terminator for 
herbicide selection, a gene fro ampicillin resistance (for selection in E. coll) and a 
polylinker with the unique sites Sphl, Pstt, Hindlll, and BamHI. This vector is suitable 
for the cloning of plant expression cassettes containing their own regulatory signals. 

Construction of pSOG19 and pSOG35 

pSOG35 is a transformation vector which utilizes the E. coli gene dihydrofolate 
reductase (DHFR) as a selectable marker conferring resistance to methotrexate. PCR 
was used to amplify the 35S promoter (-800 bp), intron 6 from the maize Adh1 gene 
(-550 bp) [Lou et al, Plant J 3: 393-403. 1993; Dennis et al, Nucl Acids Res 12: 3983- 
4000, 1984] and 18 bp of the GUS untranslated leader sequence from pSOG10. A 250 
bp fragment encoding the E. coli dihydrofolate reductase type II gene was aJso 
amplified by PCR and these two PCR fragments were assembled with a Sacl-Pstl 
fragment from pBI221 (Clontech) which comprised the pUC19 vector backbone and the 
nopaline synthase terminator. Assembly of these fragments generated pSOG1 9 which 
contains the 35S promoter in fusion with the intron 6 sequence, the GUS leader, the 
DHFR gene and the nopaline synthase terminator. Replacement of the GUS leader in 
pSOG19 with the leader sequence from Maize Chlorotic Mottle Virus (MCMV) 
generated the vector pSOG35. pSOGl9 and pSOG35 carry the pUC gene for 
ampicillin resistance and have Hindlll, Sphl, Psf/and EcoRI sites available for the 
cloning of foreign sequences. 

pSOG 10 

This ^-Glucuronidase (GUS) expression vector was derived from plasmid pBH21 , 
purchased from Clonetech Laboratories, Palo Alto, California. Intron 6 of the maize 
Adh1 gene was amplified by PCR from plasmid pB428, described in Bennetzen et al., 
Proc. Natl. Acad. Sci, USA 81:4125-4128 (1987), using oligonucleotide primers 
SON0003 and SON0004. 

SON0003 : S'-CTCGG ATCC AGC AG ATTCG A AG AAGGTACAG-3' 
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SON0004: S'-ACGGGATCCAACTTCCTAGCTGAAAAATGGG-G" 

The PCR reaction product was digested with restriction endonuclease BamHI. 
cleaving the BamHI site added on the 5* end of each PCR primer. The resulting DNA 
fragment was purified on an agarose gel and ligated into the BamHI site of pBI121 , 
which is between the CaMV35S promoter and the GUS gene. The ligated DNA was 
transformed into E.coli and clones with the Adh1 intron 6 in the same orientation as the 
GUS gene were identified by restriction digest. 

dSOG 19 

This dihydrofolate reductase (DHFR) expression vector was derived by fusing the 
35S promotor and Adh1 intron 6 of pSOGIO to the DHFR gene from plasmid pHCO, 
described in Bourouis and Jarry, EMBO J. 2: 1099-1 104 (1983) The 35S promoter and 
Adh1 intron 6 were produced by PCR amplification of the fragment from pSOGIO using 
primers SON0031 and SON0010. 

SON0031 : S'-CATGAGGGACTGACCACCCGGGGATC-^ 
SON001 0: 5'-AGCGGATAACAATTTCACACAGGA-3* 

The resulting fragment was digested with restriction endonucleases Psti and BspHI 
and purified on an agarose gel. 

The DHFR coding region was produced by PCR amplification of pHCO using 
primers SON0016 and SON0017. 



SON001 6: 5 , -GCTACCATGGCCACATAGAACACC-3 , 
SON001 7: 5'-CGAGAGCTCGCACTTCAACCTTG-3' 

The resulting fragment was digested with restriction endonucleases Nsol and Sacl 
and purified on an agarose gel. 

The two fragments described above were ligated into a vector fragment prepared 
from pBM 21 by digestion with restriction endonucleases Pstl and Sacl and purification 
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on an agarose gel. This three way ligation fused the 35S promoter-Adhl intron 6-DHFR 
gene-Nos terminator in correct order and orientation for functional expression in plants. 

pSOG 30 

This GUS expression vector was derived from pSOG 10 by the insertion of the 
maize chlorotic mottle virus (MCMV) leader, described in Lommel et al., Virology 181 : 
382-385 (1991), into the 35S-GUS gene non-translated leader by a three way ligation. 

Both strands of the 1 7 bp MCMV capsid protein leader sequence plus appropriate 
restriction endonuclease sights were synthesized and annealed. The resulting double 
stranded fragment was degested with BamHI and Ncol and purified on an acrylamide 
gel. 

The GUS gene coding region was amplified by PCR using primers SON0039 and 
SON0041 and pBI121 as a template. 

SON0039:5*-CGACATGGTACGTCCTGTAGAAACCCACA-3* 
SON0041 : 5*-ATCGCAAGACCGGCAACAGGATTC-3* 

These primers added an Ncol site to the 5' end of GUS and a Sad site to the 3* 
end of GUS. The resulting fragment was digested with restriction endonucleases Ncol 
and Sad and purified on an agarose gel. 

The GUS gene was removed from the plasmid pSOG 10 by digestion with 
restriction endonuclease Sacl and partial digestion with restriction endonuclease 
BamHI. The resulting vector, which has a BamHI site and a Sacl site in which to reinsert 
a coding region behind the 35S promoter-Adhl intron 6, was purified on an agarose 
gel. 

The three fragments described above were ligated in a three way ligation to 
produce a gene fusion with the structure: 35Spromoter-Adh1 intron 6-MCMV 
leader-GUS-Nos terminator, all in the pUC19 vector backbone. 



pSOG 35 
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The DHFR selectable marker vector is identical to pSOGl9, except that the MCMV 
leader is inserted in the non-translated leader of the DHFR gene to enhance 
translation. It was created in two steps. First the GUS coding region in pSOG32, a 
vector identical to pSOG30 except that it contains a modified Adh promoter rather than 
35S, was replaced with DHFR coding region from pSOG19 by excising the GUS with 
Ncol and Sad and ligating in the DHFR as an Ncol-Sacl fragment. This resulting in 
vector pSOG33 which has the gene structure Adh promoter-Adh1 intron 6-MCMV 
leader-DHFR coding region-Nos terminator, with a Bglll site between the promoter and 
Intron and a Sad site between the coding region and the terminator. The Bglll-Sacl 
fragment was isolated by restriction endonuclease digestion and agarose gel 
purification, and ligated into the BamHI and Sad sites of pSOG30, replacing the Adh1 
intronZ6-MCMV leader-GUS coding region of pSOG30 with the Adh1 intron 6-MCMV 
leader-DHFR coding region of pSOG33. 

EXAMPLE 21 : Construction of Plant Expression Cassettes 

Gene sequences intended for expression in transgenic plants are firstly assembled 
in expression cassettes behind a suitable promoter and upstream of a suitable 
transcription terminator. These expression cassettes can then be easily transferred to 
the plant transformation vectors described above in Example 19. 

Promoter Selection 

The selection of a promoter used in expression cassettes will determine the spatial 
and temporal expression pattern of the transgene in the transgenic plant. Selected 
promoters will express transgenes in specific cell types (such as leaf epidermal cells, 
mesophyll cells, root cortex cells) or in specific tissues or organs (roots, leaves or 
flowers, for example) and this selection will reflect the desired location of expression of 
the transgene. Alternatively, the selected promoter may drive expression of the gene 
under a light-induced or other temporally regulated promoter. A further alternative is 
that the selected promoter be chemically regulated. This would provide the possibility 
of inducing expression of the transgene only when desired and caused by treatment 
with a chemical inducer. 
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Transcriptional Terminators 

A variety of transcriptional terminators are available for use in expression cassettes. 
These are responsible for the termination of transcription beyond the transgene and its 
correct potyadenylation. Appropriate transcriptional terminators and those which are 
known to function in plants and include the CaMV 35S terminator, the tml terminator, 
the nopaline synthase terminator, the pea rbcS E9 terminator. These can be used in 
both monocotyledons and dicotyledons. 

Sequences for the Enhancement or Regulation of Expression 

Numerous sequences have been found to enhance gene expression from within the 
transcriptional unit and these sequences can be used in conjunction with the genes of 
this invention to increase their expression in transgenic plants. 

Various intron sequences have been shown to enhance expression, particularly in 
monocotyledonous cells. For example, the introns of the maize Adh1 gene have been 
found to significantly enhance the expression of the wild-type gene under its cognate 
promoter when introduced into maize cells. Intron 1 was found to be particularly 
effective and enhanced expression in fusion constructs with the chloramphenicol 
acetyltransferase gene {Callis etal., Genes Develop. J_: 1 183-1200 (1987)). In the 
same experimental system, the intron from the maize bronze 1 gene had a similar effect 
in enhancing expression (Callis et at., supra). Intron sequences have been routinely 
incorporated into plant transformation vectors, typically within the non-translated leader. 

A number of non-translated leader sequences derived from viruses are also known 
to enhance expression, and these are particularly effective in dicotyledonous cells. 
Specifically, leader sequences from Tobacco Mosaic Virus (TMV. the "W -sequence "), 
Maize Chlorotic Mottle Virus (MCMV), and Alfalfa Mosaic Virus (AMV) have been shown 
to be effective in enhancing expression (e.g. Gallie etal. Nuci Acids Res. 15: 8693- 
8711 (1987); Skuzeski etal. Plant Molec. Biol. 15: 65-79 (1990)) 

Targeting of the Gene Product Within the Cell 
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Various mechanisms for targeting gene products are known to exist in plants and 
the sequences controlling the functioning of these mechanisms have been 
characterized in some detail. For example, the targeting of gene products to the 
chloroplast is controlled by a signal sequence found at the amino terminal end of 
various proteins and which is cleaved during chloroplast import yielding the mature 
protein (e.g. Comai etal. J. Biol. Chem. 263: 15104-15109 (1988)). These signal 
sequences can be fused to heterologous gene products to effect the import of 
heterologous products into the chloroplast (van den Broeck etal. Nature 313: 358-363 
(1985)). DfslA encoding for appropriate signal sequences can be isolated from the 5' 
end of the cDNAs encoding the RUBISCO protein, the CAB protein, the EPSP 
synthase enzyme, the GS2 protein and many other proteins which are known to be 
chloroplast localized. 

Other gene products are localized to other organelles such as the mitochondrion 
and the peroxisome (e.g. linger etal. Plant Molec. Biol. 13: 411-418 (1989)). The 
cDNAs encoding these products can also be manipulated to effect the targeting of 
heterologous gene products to these organelles. Examples of such sequences are the 
nuclear-encoded ATPases and specific aspartate amino transferase isoforms for 
mitochondria. Targeting to cellular protein bodies has been described by Rogers et al., 
Proc. Natl. Acad. Sci. USA 82: 6512-6516 (1985)). 

In addition sequences have been characterized which cause the targeting of gene 
products to other cell compartments. Amino terminal sequences are responsible for 
targeting to the ER, the apoplast, and extracellular secretion from aleurone cells 
(Koehler & Ho. Plant Celt 2: 769-783 (1990)). Additionally, amino terminal sequences 
in conjunction with carboxy terminal sequences are responsible for vacuolar targeting 
of gene products (Shinshi etal., Plant Molec. Biol 74:357-368 (1990)). 

By the fusion of the appropriate targeting sequences described above to transgene 
sequences of interest it is possible to direct the transgene product to any organelle or 
cell compartment. For chloroplast targeting, for example, the chloroplast signal 
sequence from the RUBISCO gene, the CAB gene, the EPSP synthase gene, or the 
GS2 gene is fused in frame to the amino terminal ATG of the transgene. The signal 
sequence selected should include the known cleavage site and the fusion constructed 
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should take into account any amino acids after the cleavage site which are required for 
cleavage. In some cases this requirement may be fulfilled by the addition of a small 
number of amino acids between the cleavage site and the transgene ATG or 
alternatively replacement of some amino acids within the transgene sequence. Fusions 
constructed for chloroplast import can be tested for efficacy of chloroplast uptake by in 
vitro translation of in vitro transcribed constructions followed by in vitro chloroplast 
uptake using techniques described by (Bartiett etal. In: Edelmann etal. (Eds.) Methods 
in Chloroplast Molecular Biology . Elsevier, pp 1081-1091 (1982); Wasmann etai Mol. 
Gen, Genet 205: 446-453 (1986)). These construction techniques are well known in 
the art and are equally applicable to mitochondria and peroxisomes. The choice of 
targeting which may be required for expression of the transgenes will depend on the 
cellular localization of the precursor required as the starting point for a given pathway. 
This will usually be cytosolic or chloropiastic, although it may is some cases be 
mitochondrial or peroxisomal. The products of transgene expression will not normally 
require targeting to the ER, the apoplast or the vacuole. 

The above described mechanisms for cellular targeting can be utilized not only in 
conjunction with their cognate promoters, but also in conjunction with heterologous 
promoters so as to effect a specific cell targeting goal under the transcriptional 
regulation of a promoter which has an expression pattern different to that of the 
promoter from which the targeting signal derives. 

EXAMPLE 22: Transformation of Dicotyledons 

Transformation techniques for dicotyledons are well known in the art and include 
Agrobacterium-based techniques and techniques which do not require Agrobacterium. 
Non- Agrobacterium techniques involve the uptake of exogenous genetic material 
directly by protoplasts or cells. This can be accomplished by PEG or electroporation 
mediated uptake, particle bombardment-mediated delivery, or microinjection. Examples 
of these techniques are described by Paszkowski et a/., EMBO J 3: 271 7-2722 (1 984), 
Potrykus etai, Mol. Gen. Genet 199: 169-177 (1985), Reich etal., Biotechnology 4: 
1001-1004 (1986), and Klein etai. Nature 327: 70-73 (1987). In each case the 
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transformed cells are regenerated to whole plants using standard techniques known in 
the art. 

Agrobacterium-mediated transformation is a preferred technique for transformation 
of dicotyledons because of its high efficiency of transformation and its broad utility with 
many different species. The many crop species which are routinely transformable by 
Agrobacterium include tobacco, tomato, sunflower, cotton, oilseed rape, potato, 
soybean, alfalfa and poplar (EP 0 317 511 (cotton), EP 0 249 432 (tomato, to 
Calgene), WO 87/07299 (Brassica, to Calgene), US 4,795,855 (poplar)). 
Agrobacterium transformation typically involves the transfer of the binary vector 
carrying the foreign DNA of interest (e.g. pCIB200 or pCIB2001) to an appropriate 
Agrobacterium strain which may depend of the complement of vir genes carried by the 
host Agrobacterium strain either on a co-resident Ti plasmid or chromosomally (e.g. 
strain CIB542 for pCIB200 and pCIB2001 (Uknes etal. Plant Cell 5: 159-169 (1993)). 
The transfer of the recombinant binary vector to Agrobacterium is accomplished by a 
triparental mating procedure using E. coli carrying the recombinant binary vector, a 
helper E. coli strain which carries a plasmid such as pRK2013 and which is able to 
mobilize the recombinant binary vector to the target Agrobacterium strain. Alternatively, 
the recombinant binary vector can be transferred to Agrobacterium by DNA 
transformation (Hofgen & Willmitzer, Nucl. Acids Res. 16: 9877(1988)). 

Transformation of the target plant species by recombinant Agrobacterium usually 
involves co-cultivation of the Agrobacterium with explants from the plant and follows 
protocols well known in the art. Transformed tissue is regenerated on selectable 
medium carrying the antibiotic or herbicide resistance marker present between the 
binary plasmid T-DNA borders. 

EXAMPLE 23: Transformation of Monocotyledons 

Transformation of most monocotyledon species has now also become routine. 
Preferred techniques include direct gene transfer into protoplasts using PEG or 
electroporation techniques, and particle bombardment into callus tissue. 
Transformations can be undertaken with a single DNA species or multiple DNA species 
(i.e. co-transformation) and both these techniques are suitable for use with this 
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invention. Co-transformation may have the advantage of avoiding complex vector 
construction and of generating transgenic plants with unlinked loci for the gene of 
interest and the selectable marker, enabling the removal of the selectable marker in 
subsequent generations, should this be regarded desirable. However, a disadvantage 
of the use of co-transformation is the less than 100% frequency with which separate 
DNA species are integrated into the genome (Schocher etal. Biotechnology 4: 1093- 
1096(1986)). 

Patent Applications EP 0 292 435 (to Ciba-Geigy). EP 0 392 225 (to Ciba-Geigy) 
and WO 93/07278 (to Ciba-Geigy) describe techniques for the preparation of callus 
and protoplasts from an eiiie inbred line of maize, transformation or proiopiasts using 
PEG or electroporation, and the regeneration of maize plants from transformed 
protoplasts. Gordon-Kamm et a/., Plant Cell 2:603-618 (1990)) and Fromm et al. t 
Biotechnology 8: 833-839 (1990)) have published techniques for transformation of 
A188-derived maize line using particle bombardment. Furthermore, application WO 
93/07278 (to Ciba-Geigy) and Koziel et al.. Biotechnology 1 1: 194-200 (1993)) describe 
techniques for the transformation of 6lite inbred lines of maize by particle 
bombardment. This technique utilizes immature maize embryos of 1.5-2.5 mm length 
excised from a maize ear 14-15 days after pollination and a PDS-1000He Biolistics 
device for bombardment. 

Transformation of rice can also be undertaken by direct gene transfer techniques 
utilizing protoplasts or particle bombardment. Protoplast-mediated transformation has 
been described for Japonica-types and Indica-types (Zhang etaL, Plant Cell Rep 7: 
379-384 (1988); Shimamoto et al. Nature 338: 274-277 (1989); Datta etal. 
Biotechnology 8: 736-740 (1990)). Both types are also routinely transformable using 

ftani& i a b6wb^dm6ht (ihrtstau u a/. Uto\echnoiogyMh>-m (iyai)). 

Patent Application EP 0 332 581 (to Ciba-Geigy) describes techniques for the 
generation, transformation and regeneration of Pooideae protoplasts. These 
techniques allow the transformation of Dactylis and wheat. Furthermore, wheat 
transformation was been described by Vasil etal., Biotechnology 70:667-674 (1992)) 
using particle bombardment into cells of type C long-term regenerable callus, and also 
by Vasil etal. f Biotechnology 11: 1553-1558 (1993)) and Weeks etal., Plant Physiol. 
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102: 1077-1084 (1993) using particle bombardment of immature embryos and 
immature embryo-derived callus. A preferred technique for wheat transformation, 
however, involves the transformation of wheat by particle bombardment of immature 
embryos and includes either a high sucrose or a high maltose step prior to gene 
delivery. Prior to bombardment, any number of embryos (0.75-1 mm in length) are 
plated onto MS medium with 3% sucrose (Murashige & Skoog, Physiologia Plantarum 
15: 473-497 (1962)) and 3 mg/1 2,4-D for induction of somatic embryos which is allowed 
to proceed in the dark. On the chosen day of bombardment, embryos are removed 
from the induction medium and placed onto the osmoticum (Le. induction medium with 
sucrose or maltose added at the desired concentration, typically 15%). The embryos 
are allowed to plasmolyze for 2-3 h and are then bombarded. Twenty embryos per 
target plate is typical, although not critical An appropriate gene-carrying plasmid (such 
as pCIB3064 or pSG35) is precipitated onto micrometer size gold particles using 
standard procedures. Each plate of embryos is shot with the DuPont Biolistics* helium 
device using a burst pressure of -1 000 psi using a standard 80 mesh screen. After 
bombardment, the embryos are placed back into the dark to recover for about 24 h (still 
on osmoticum). After 24 hrs, the embryos are removed from the osmoticum and placed 
back onto induction medium where they stay for about a month before regeneration. 
Approximately one month later the embryo explants with developing embryogenic 
callus are transferred to regeneration medium (MS + 1 mg/liter NAA, 5 mg/liter GA), 
further containing the appropriate selection agent (10 mg/l basta in the case of 
pCIB3064 and 2 mg/l methotrexate in the case of pSOG35). After approximately one 
month, developed shoots are transferred to larger sterile containers known as "GA7s" 
which contained half-strength MS, 2% sucrose, and the same concentration of 
selection agent. Patent application 08/147,161 describes methods for wheat 
transformation and is hereby incorporated by reference. 

EXAMPLE 24:Selecting for plant protox genes resistant to protox-inhibitory 
herbicides in the E. coli expression system 

The plasmid pWDC-4, encoding the maize chloroplastic protox enzyme, is 
transformed into the random mutagenesis strain XL1-Red (Stratagene, La Jolla, CA). 
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The transformation is plated on L media containing 50 g/ml ampicillin and incubated for 
48 hours at 37 C. Lawns of transformed cells are scraped from the plates and plasmid 
DNA prepared using the Wizard Megaprep kit (Promega, Madison, Wl). Plasmid DNA 
isolated from this mutator strain is predicted to contain approximately one random base 
change per 2000 nucleotides (see Greener et al, Strategies 7(2)32-34 (1994)). 

The mutated plasmid DNA is transformed into the hemG mutant SASX38 (Sasarman 
etai, J. Gen. Microbiol. 113:297 (1979) and plated on L media containing 100 g/ml 
ampicillin and on the same media containing various concentrations of protox-inhibiting 
herbicide. The plates are incubated for 2-3 days at 37 C. Plasmid DNA is isolated from 
all colonies thai grow in the presence of herbicide concentrations that effectively kill the 
wild type strain. The isolated DNA is then transformed into SASX38 and plated again 
on herbicide to ensure that the resistance is plasmid-borne. 

Mutated pWDC-4 plasmid DNA is again isolated from resistant colonies and the 
protox coding sequence is excised by digestion with EcoRl and Xhol. The excised 
protox coding sequence is then recloned into an unmutagenized pBIuescript vector and 
retested for resistance to protox-inhibiting herbicide in the same manner described 
above. 

This process eliminates non-coding sequence mutations which confer resistance 
such as up-promoter mutants (i.e. mutants whose resistance is due to mutations 
causing increased expression of unmodified protox) and leaves only mutants whose 
resistance is due to mutations in the protox coding sequence. The DNA sequence for 
all putative herbicide-tolerant protox genes identified through this process is determined 
and mutations are identified by comparison with the wild-type pWDC-4 protox 
sequence. 

U * m§ I h a j j i uuuduiu Uuuu i iUlu uUuii, u iLJliiailLft IHUldliUli LUlH/uiWiy aLlQj I HI 
nucleotide 498 in the pWDC-4 sequence (SEQ ID No. 5) has been identified. The 
plasmid carrying this mutation has been designated pMzC-1 Val. This change converts 
a GCT codon for alanine at amino acid 166 (SEQ ID No. 6) to a GTT codon for valine 
and results in a protox enzyme that is at least 10X more resistant to protox-inhibiting 
herbicide in the bacterial assay. 



WO 95/34659 



PCT/IB95/00452 



- 65 - 

pMzC-1 Val, in the pBluescript SK vector, was deposited on September 30, 1994 under 
the designation pWDC-8 with the Agricultural Research Culture Collection and given 
the deposit designation NRRL #21340. 

The same strategy was used to screen for herbicide-resistant forms of the 
Arabidopsis Protox-1 gene in various vectors. One resistance mutation identified is a C 
to T change at nucleotide 689 in the pWDC-2 sequence (SEQ ID No. 1); this plasmid is 
designated pAraC-1 Val. This change is identical to the pMzC-1 Val mutant above, 
converting a GCT codon for alanine at amino acid 220 (SEQ ID No. 2) to a GTT codon 
for valine at the corresponding position in the Arabidopsis proiox protein sequence. 

A second resistant gene contains an A to G change at nucleotide 1307 in the 
pWDC-2 sequence (SEQ ID No. 1); this plasmid is designated pAraC-2Cys. This 
change converts TAC codon for tyrosine at amino acid 426 (SEQ ID No. 2) to a TGC 
codon for cysteine. The corresponding tyrosine codon in the maize protox-1 sequence 
at nucleotide position 1115-1117 (SEQ ID NO. 5; amino acid position 372 of SEQ ID 
NO. 6) may be similarly mutated to generate a herbicide resistant form of this enzyme. 

A third resistant mutant has a G to A change at nucleotide 691 in the pWDC-2 
sequence (SEQ ID No. 1); this plasmid is designated pAraC-3Ser. This mutation 
converts GGT codon for glycine at amino acid 221 SEQ ID No. 2) to an AGT codon for 
serine at the codon position adjacent to the mutation in pAraC-1 . The corresponding 
glycine codon in the maize protox-1 sequence at nucleotide position 497-499 (SEQ ID 
NO. 5; amino acid position 167 of SEQ ID NO. 6) may be similarly mutated to generate 
a herbicide resistant form of this enzyme. 

All the mutations described above result in a protox enzyme that is at least 10X 

II1UIU ILJiJiUlll 

pAraC-2Cys, in the pFL61 vector, was deposited on November 14, 1994 under the 
designation pWDC-7 with the Agricultural Research Culture Collection and given the 
deposit designation NRRL#21339N. 
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EXAMPLE 25: Additional herbicide-resistant codon substitutions at 

positions identified in the random screen 

The amino acids identified as herbicide resistance sites in the random screen are 
replaced by other amino acids and tested for function and for herbicide tolerance in the 
bacterial system. Oligonucleotide-directed mutagenesis of the Arabidopsis Protox- 1 
sequence is performed using the Transformer Site-Directed Mutagenesis Kit (Clontech, 
Palo Alto, CA). After amino acid changes are confirmed by sequence analysis, the 
mutated plasmids are transformed into SASX38 and plated on L-amp 100 media to test 
for function and on various concentrations of protox-inhibiting herbicide to test for 
tolerance. 

This procedure was applied to the alanine codon at nucleotides 688-690 and to the 
tyrosine codon at nucleotides 1306-1308 of the Arabidopsis protox sequence (SEQ ID 
No. 1). The results demonstrate that the alanine codon at nucleotides 688-690 can be 
changed to a codon for valine, threonine, leucine, or cysteine to yield a herbicide- 
resistant protox enzyme which retains function. The results further demonstrate that 
the tyrosine codon at nucleotides 1306-1308 can be changed to a codon for cysteine, 
isoleucine, leucine, valine or threonine to yield a herbicide-resistant protox enzyme 
which retains function. 

EXAMPLE 26: Demonstration of resistant mutations' cross-tolerance to 

various protox-inhibiting compounds. 

Resistant mutant plasmids, selected for resistance against a single herbicide, are 
tested against a spectrum of other protox-inhibiting compounds. The SASX38 strain 
containing the wild-type plasmid is plated on a range of concentrations of each 
compound to determine the lethal concentration for each one. Resistant mutant 
plasmids in SASX38 are plated and scored for the ability to survive on a concentration 
of each compound which is at least 10 fold higher than the concentration that is lethal 
to the SASX38 strain containing the wild-type plasmid. 

Results from cross-tolerance testing show that each of the mutations identified 
confer tolerance to a variety of protox inhibiting compounds. In particular, the results 
show that 1) the AraCl-Val mutation confers resistance to protox inhibitors including, 
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but not necessarily limited to, those having the Formulae IV, XI, XIII, XIV, XV and XVII; 
2) the AraC-2Cys mutation confers resistance to protox inhibitors including, but not 
necessarily limited to, those having Formulae XI, XIII, XV and XVII; 3) the MzC-1 Val 
mutation confers resistance to protox inhibitors including, but not necessarily limited to, 
those having the Formulae XI, XII, XIII, XIV, XV, XVI and XVII; 4) the AraC-3Ser 
mutation confers resistance to protox inhibitors including, but not necessarily limited to, 
bifenox and those having the Formulae IV, XII, XIII, XIV, XV, and XVII. 

EXAMPLE 27: Production of herbicide tolerant plants by overexpression 

of plant protox genes. 

The Arabidopsis Protox- 1 coding sequences from both the wild-type and the 
resistant mutant AraC-1 Val genes are excised by partial EcoRI and Xhol digestion and 
cloned into the pCGN1 761 ENX plant expression plasmid. The expression cassettes 
containing 2X35S-Protox gene fusions areare excised by digestion with Xbal and 
cloned into the binary vector pCIB200. These binary protox plasmids are transformed 
by electroporation into Agrobacterium and then into Arabidopsis using the vacuum 
infiltration method (Bechtold et a!., 1993). Transformants are selected on kanamycin, 
and T2 seed is generated from a number of independent lines. This seed is plated on 
GM media containing various concentrations of protox-inhibiting herbicide and scored 
for germination and survival. Multiple transgenic lines overexpressing either the wild 
type or the resistant mutant protox produce significant numbers of green seedlings on 
an herbicide concentration that is lethal to the empty vector control. 



Various modifications of the invention described herein will become apparent to 
those skilled in the art. Such modifications are intended to fall within the scope of the 
appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANT: 

(A) NAME: CIBA-GEIGY AG 

(B) STREET: Klybeckstr. 141 

(C) CITY: Basel 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE {ZIP) : 4002 
<G) TELEPHONE: +41 61 69 11 11 

(H) TELEFAX: + 41 61 696 79 76 

(I) TELEX: 962 991 

(ii) TITLE OF INVENTION: MANIPULATION OF PROTOPORPHYRINOGEN OXIDASE 
ENZYME ACTIVITY IN EUKARYOTIC ORGANISMS 



(iii) NUMBER OF SEQUENCES: 20 

<iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
<B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DO ~ /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ID NO:l: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 31. .1644 

(D) OTHER INFORMATION: /note= "Arabidopsis protox-1 cDNA; 
sequence from pWDC-2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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TGACAAAATT CCGAATTCTC TGCGATTTCC ATG GAG TTA TCT CTT CTC CGT CCG 

Met Glu Leu Ser Leu Leu Arg Pro 
J- b 


54 


ACG ACT CAA TCG CTT CTT CCG TCG TTT TCG AAG CCC AAT CTC CGA TTA 
Thr Thr Gin Ser Leu Leu Pro Ser Phe Ser Lys Pro Asn Leu Arg Leu 
10 15 ?n 


102 


AAT GTT TAT AAG CCT CTT AGA CTC CGT TGT TCA GTG GCC GGT GGA CCA 
Asn Val Tyr Lys Pro Leu Arg Leu Arg Cys Ser Val Ala Gly Gly Pro 
25 30 *3S An 


150 


ACC GTC GGA TCT TCA AAA ATC GAA GGC GGA GGA GGC ACC ACC ATC ACG 
Thr Val Gly Ser Ser Lys lie Glu Gly Gly Gly Gly Thr Thr He Thr 
40 55 


198 


tt. rr^ fTim mom <~«m/— «nvn r-^mr~s /— «^-. » s->s-*rt-k « nw* « ^ ... ^ . 

n^Kj uni iui oio mi oil ovjV- bUrt vjox Ai 1 hbi bbi ATC GOT' 

Thr Asp Cys Val He Val Gly Gly Gly He Ser Gly Leu Cys He Ala 
60 65 70 


246 


CAG GCG CTT GCT ACT AAG CAT CCT GAT GCT GCT CCG AAT TTA ATT GTG 
Gin Ala Leu Ala Thr Lys His Pro Asp Ala Ala Pro Asn Leu He Val 

80 85 


294 


ACC GAG GCT AAG GAT CGT GTT GGA GGC AAC ATT ATC ACT CGT GAA GAG 
Thr Glu Ala Lys Asp Arg Val Gly Gly Asn He lie Thr Arg Glu Glu 

yo 100 


342 


AAT GGT TTT CTC TGG GAA GAA GGT CCC AAT AGT TTT CAA CCG TCT GAT 
Asn Gly Phe Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp 
-LUD 110 115 120 


390 


CCT ATG CTC ACT ATG GTG GTA GAT AGT GGT TTG AAG GAT GAT TTG GTG 
Pro Met Leu Thr Met Val Val Asp Ser Gly Leu Lys Asp Asp Leu Val 
±zo 1J0 135 


438 


TTG GGA GAT CCT ACT GCG CCA AGG TTT GTG TTG TGG AAT GGG AAA TTG 
Leu Gly Asp Pro Thr Ala Pro Arg Phe Val Leu Trp Asn Gly Lys Leu 

140 14^ icrt 


486 


AGG CCG GTT CCA TCG AAG CTA ACA GAC TTA CCG TTC TTT GAT TTG ATG 


534 


Arg fro vai fro ber Lys i^eu rnr Asp i,eu Pro ^he Phe Asp Leu Met 
155 160 165 




AGT ATT GGT GGG AAG ATT AGA GCT GGT TTT GGT GCA CTT GGC ATT CGA 
Ser He Gly Gly Lys He Arg Ala Gly Phe Gly Ala Leu Gly lie Arg 
170 175 180 


582 


CCG TCA CCT CCA GGT CGT GAA GAA TCT GTG GAG GAG TTT GTA CGG CGT 
Pro Ser Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe Val Arg Arg 
185 190 195 200 


630 
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AAC CTC GGT GAT GAG GTT TTT GAG CGC CTG ATT GAA CCG TTT TGT TCA 
Asn Leu Gly Asp Glu Val Phe Glu Arg Leu lie Glu Pro Phe Cys Ser 
205 210 215 


678 


GGT GTT TAT GCT GGT GAT CCT TCA AAA CTG AGC ATG AAA GCA GCG TTT 
Gly Val ryr Ala bly Asp rro ber Lys .Leu ber Met Lys Aia Aia Fne 
220 225 230 


726 


GGG AAG GTT TGG AAA CTA GAG CAA AAT GGT GGA AGC ATA ATA GGT GGT 
Gly Lys Val Trp Lys Leu Glu Gin Asn Gly Gly Ser He He Gly Gly 
235 240 245 


774 


ACT TTT AAG GCA ATT CAG GAG AGG AAA AAC GCT CCC AAG GCA GAA CGA 
Thr Phe Lys Ala He Gin Glu Arg Lys Asn Ala Pro Lys Ala Glu Arg 
250 255 260 


822 


GAC CCG CGC CTG CCA AAA CCA CAG CCC CAA ACA GTT GGT TCT TTC AGG 
Asp Pro Arg Leu Pro Lys Pro Gin Gly Gin Thr Val Gly Ser Phe Arg 
265 270 275 280 


870 


AAG GGA CTT CGA ATG TTG CCA GAA GCA ATA TCT GCA AGA TTA GGT AGC 
Lys Gly Leu Arg Met Leu Pro Glu Ala He Ser Ala Arg Leu Gly Ser 
285 290 295 


918 


AAA GTT AAG TTG TCT TGG AAG CTC TCA GGT ATC ACT AAG CTG GAG AGC 
Lys Val Lys Leu Ser Trp Lys Leu Ser Gly He Thr Lys Leu Glu Ser 
300 305 310 


966 


GGA GGA TAC AAC TTA ACA TAT GAG ACT CCA GAT GGT TTA GTT TCC GTG 
Gly Gly Tyr Asn Leu Thr Tyr Glu Thr Pro Asp Gly Leu Val Ser Val 
315 320 325 


1014 


CAG AGC AAA AGT GTT GTA ATG ACG GTG CCA TCT CAT GTT GCA AGT GGT 
Gin Ser Lys Ser Val Val Met Thr Val Pro Ser His Val Ala Ser Gly 
330 335 340 


1062 


CTC TTG CGC CCT CTT TCT GAA TCT GCT GCA AAT GCA CTC TCA AAA CTA 
Leu Leu Arg Pro Leu Ser Glu Ser Ala Ala Asn Ala Leu Ser Lys Leu 
345 350 355 360 


1110 


TAT TAC CCA CCA GTT GCA GCA GTA TCT ATC TCG TAC CCG AAA GAA GCA 


1158 


lyi lyi Flu fiu uai Aia Ala uui jti life jfei 1^1 ilu uyj oiu i\lu 

365 370 375 




ATC CGA ACA GAA TGT TTG ATA GAT GGT GAA CTA AAG GGT TTT GGG CAA 
He Arg Thr Glu Cys Leu He Asp Gly Glu Leu Lys Gly Phe Gly Gin 
380 385 390 


1206 


TTG CAT CCA CGC ACG CAA GGA GTT GAA ACA TTA GGA ACT ATC TAC AGC 
Leu His Pro Arg Thr Gin Gly Val Glu Thr Leu Gly Thr lie Tyr Ser 
395 400 405 


1254 
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TCC TCA CTC TTT CCA AAT CGC GCA CCG CCC GGA AGA ATT TTG CTG TTG 
Ser Ser Leu Phe Pro Asn Arg Ala Pro Pro Gly Arg lie Leu Leu Leu 
410 415 420 


1302 


AAC TAC ATT GGC GGG TCT ACA AAC ACC GGA ATT CTG TCC AAG TCT GAA 
Asn Tyr lie Gly Gly Ser Thr Asn Thr Gly He Leu Ser Lys Ser Glu 
425 430 435 440 


1350 


GGT GAG TTA GTG GAA GCA GTT GAC AGA GAT TTG AGG AAA ATG CTA ATT 
Gly Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys Met Leu He 
445 450 455 


1398 


AAG CCT AAT TCG ACC GAT CCA CTT AAA TTA GGA GTT AGG GTA TGG CCT 
Lys Pro Asn Ser Thr Asp Pro Leu Lys Leu Gly Val Arg Val Trp Pro 
460 - 465 470 


1446 


CAA GCC ATT CCT CAG TTT CTA GTT GGT CAC TTT GAT ATC CTT GAC ACG 
Gin Ala He Pro Gin Phe Leu Val Gly His Phe Asp He Leu Asp Thr 
475 480 485 


T A CiA 


GCT AAA TCA TCT CTA ACG TCT TCG GGC TAC GAA GGG CTA TTT TTG GGT 
Ala Lys Ser Ser Leu Thr Ser Ser Gly Tyr Glu Gly Leu Phe Leu Gly 
490 495 500 


1542 


GGC AAT TAC GTC GCT GGT GTA GCC TTA GGC CGG TGT GTA GAA GGC GCA 
Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys Val Glu Gly Ala 
505 510 515 520 


1590 


TAT GAA ACC GCG ATT GAG GTC AAC AAC TTC ATG TCA CGG TAC GCT TAC 
Tyr Glu Thr Ala lie Glu Val Asn Asn Phe Met Ser Arg Tyr Ala Tyr 

OC COA CQC 

DZD OJV OJD 


1638 


AAG TAAATGTAAA ACATTAAATC TCCCAGCTTG CGTGAGTTTT ATTAAATATT 
Lys 


1691 


TTGAGATATC CAAAAAAAAA AAAAAAAA 


1719 


(2) INFORMATION FOR SEQ ID NO:2: 




ti) "gEQUBHCE (M^l'lJtU^fltti: 

(A) LENGTH: 537 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 




(ii) MOLECULE TYPE: protein 




<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 




Met Glu Leu Ser Leu Leu Arg Pro Thr Thr Gin Ser Leu Leu Pro Ser 
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10 



15 



Phe Ser Lys Pro Asn Leu Arg Leu Asn Val Tyr Lys Pro Leu Arg Leu 
20 25 30 

Arg Cys Ser Val Ala Gly Gly Pro Thr Val Gly Ser Ser Lys lie Glu 
35 40 45 

Gly Gly Gly Gly Thr Thr He Thr Thr Asp Cys Val He Val Gly Gly 
50 55 60 

Gly lie Ser Gly Leu Cys He Ala Gin Ala Leu Ala Thr Lys His Pro 
65 70 75 80 

Asp Ala Ala Pro Asn Leu He Val Thr Glu Ala Lys Asp Arg Val Gly 
85 90 95 

Gly Asn He lie Thr Arg Glu Glv A.^n Gly Phe T<ru Trp Glu Glu Gly 
100 * 105 ^ 110 

Pro Asn Ser Phe Gin Pro Ser Asp Pro Met Leu Thr Met Val Val Asp 
115 120 125 

Ser Gly Leu Lys Asp Asp Leu Val Leu Gly Asp Pro Thr Ala Pro Arg 
130 135 140 

Phe Val Leu Trp Asn Gly Lys Leu Arg Pro Val Pro Ser Lys Leu Thr 
145 150 155 160 

Asp Leu Pro Phe Phe Asp Leu Met Ser He Gly Gly Lys He Arg Ala 
165 170 175 

Gly Phe Gly Ala Leu Gly He Arg Pro Ser Pro Pro Gly Arg Glu Glu 
180 185 190 



Ser Val Glu Glu Phe Val Arg Arg Asn Leu Gly Asp Glu Val Phe Glu 
195 200 205 

Arg Leu He Glu Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser 
210 215 220 

Lys Leu Ser Met Lys Ala Ala Phe Gly Lys Val Trp Lys Leu Glu Gin 
ilJ 1JU £35 aio 

Asn Gly Gly Ser He He Gly Gly Thr Phe Lys Ala He Gin Glu Arg 
245 250 255 

Lys Asn Ala Pro Lys Ala Glu Arg Asp Pro Arg Leu Pro Lys Pro Gin 
260 265 270 

Gly Gin Thr Val Gly Ser Phe Arg Lys Gly Leu Arg Met Leu Pro Glu 
275 280 285 
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Ala lie Ser Ala Arg Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu 
290 295 300 

Ser Gly He Thr Lys Leu Glu Ser Gly Gly Tyr Asn Leu Thr Tyr Glu 
305 310 315 320 

Thr Pro Asp Gly Leu Val Ser Val Gin Ser Lys Ser Val Val Met Thr 
325 330 335 

Val Pro Ser His Val Ala Ser Gly Leu Leu Arg Pro Leu Ser Glu Ser 
340 345 350 

Ala Ala Asn Ala Leu Ser Lys Leu Tyr Tyr Pro Pro Val Ala Ala Val 
355 360 365 

Ser He Ser Tyr Pro Lys Glu Ala He Arg Thr Glu Cys Leu He Asp 



Gly Glu Leu Lys Gly Phe Gly Gin Leu His Pro Arg Thr Gin Gly Val 
385 390 395 400 

Glu Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala 
405 410 415 

Pro Pro Gly Arg He Leu Leu Leu Asn Tyr He Gly Gly Ser Thr Asn 
420 425 430 

Thr Gly He Leu Ser Lys Ser Glu Gly Glu Leu Val Glu Ala Val Asp 
435 440 445 

Arg Asp Leu Arg Lys Met Leu He Lys Pro Asn Ser Thr Asp Pro Leu 
450 455 460 

Lys Leu Gly Val Arg Val Trp Pro Gin Ala He Pro Gin Phe Leu Val 
465 470 475 480 

Gly His Phe Asp He Leu Asp Thr Ala Lys Ser Ser Leu Thr Ser Ser 
485 490 495 

Gly Tyr Glu Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala 
500 505 510 



Leu Gly Arg Cys Val Glu Gly Ala Tyr Glu Thr Ala lie Glu Val Asn 
515 520 525 



Asn Phe Met Ser Arg Tyr Ala Tyr Lys 
530 535 



(2) INFORMATION FOR SEQ ID NO: 3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 70.. 1596 

(D) OTHER INFORMATION: /note- "Arabidopsis protox-2 cDNA; 
sequence from pWDC-1" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTTTTTACTT ATTTCCGTCA CTGCTTTCGA CTGGTCAGAG ATTTTGACTC TGAATTGTTG 60 

CAGATAGCA ATG GCG TCT GGA GCA GTA GCA GAT CAT CAA ATT GAA GCG 108 
Met Ala Ser Gly Ala Val Ala Asp His Gin He Glu Ala 
15 10 

GTT TCA GGA AAA AGA GTC GCA GTC GTA GGT GCA GGT GTA AGT GGA CTT 156 
Val Ser Gly Lys Arg Val Ala Val Val Gly Ala Gly Val Ser Gly Leu 
15 20 25 

GCG GCG GCT TAC AAG TTG AAA TCG AGG GGT TTG AAT GTG ACT GTG TTT 204 
Ala Ala Ala Tyr Lys Leu Lys Ser Arg Gly Leu Asn Val Thr Val Phe 
30 35 40 45 

GAA GCT GAT GGA AGA GTA GGT GGG AAG TTG AGA AGT GTT ATG CAA AAT 252 
Glu Ala Asp Gly Arg Val Gly Gly Lys Leu Arg Ser Val Met Gin Asn 
50 55 60 

GGT TTG ATT TGG GAT GAA GGA GCA AAC ACC ATG ACT GAG GCT GAG CCA 300 
Gly Leu He Trp Asp Glu Gly Ala Asn Thr Met Thr Glu Ala Glu Pro 

C S ^0 ^ 

GAA GTT GGG AGT TTA CTT GAT GAT CTT GGG CTT CGT GAG AAA CAA CAA 348 
Glu Val Gly Ser Leu Leu Asp Asp Leu Gly Leu Arg Glu Lys Gin Gin 
80 85 90 

TTT CCA ATT TCA CAG AAA AAG CGG TAT ATT GTG CGG AAT GGT GTA CCT 396 
Phe Pro He Ser Gin Lys Lys Arg Tyr He Val Arg Asn Gly Val Pro 
95 100 105 
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rrr ATG cta cct acc aat ccc ATA gag cig gtc aca agt agt gtg ctc 

Si Met 2u pS Thr Asn Pro He Glu Leu Val Thr Ser Ser Val 



110 us 120 

TCT ACC CAA TCT AAG TTT CAA ATC TTG TTG GAA CCA TTT TTA TGG AAG 
S S Lys Phe Gin lie Leu Leu Glu Pre Phe Leu Trp Lys 
130 135 

AAA AAG TCC TCA AAA GTC TCA GAT GCA TCT GCT GAA GAA AGT GTA AGC 
Lys Lys Ser Ser Lys Val Ser Asp Ala Ser Ala Glu Glu Ser Val Ser 
145 150 13 

GAG TTC TTT CAA CGC CAT TTT GGA CAA GAG GTT GTT GAC TAT CTC ATC 
Si JS Gin Arg His Phe Gly Gin Glu Val Val Asp Tyr Leu lie 
160 I 65 1 

GAC CCT TTT GTT GGT GGA ACA AGT GCT GCG GAC CCT GAT TCC CTT TCA 
S pS Pne val Gly Gly Thr Ser Ala Ala Asp Pro Asp Ser Leu Ser 
175 180 185 

ATG AAG CAT TCT TTC CCA GAT CTC TGG AAT GTA GAG AAA AGT TTT GGC 
£2 J£ Ss Ser Phe Pro Asp Leu Trp Asn Val Glu Lys Ser Phe Gly 
190 195 200 

TCT ATT ATA GTC GGT GCA ATC AGA ACA AAG TTT GCT GCT AAA GGT GGT 
Sr lie lie Val Gly Ala He Arg Thr Lys Phe Ala Ala Lys Gly Gly 
210 2X5 ' 

AAA AGT AGA GAC ACA AAG AGT TCT CCT GGC ACA AAA AAG GGT TCG CGT 
Lys Ser Arg Asp Thr Lys Ser Ser Pro Gly Thr Lys Lys Gly Ser Arg 
225 230 

GGG TCA TTC TCT TTT AAG GGG GGA ATG CAG ATT CTT CCT GAT ACG TTG 
S S S 2r Phe Lys Gly Gly Met Gin lie Leu Pro Asp Thr Leu 
240 245 250 

TGC AAA AGT CTC TCA CAT GAT GAG ATC AAT TTA GAC TCC AAG GTA CTC 
Cys Lys Ser Leu Ser His Asp Glu lie Asn Leu Asp Ser Lys Val Leu 
255 260 265 

TCT TTG TCT TAG AAT TCT GGA TCA AGA CAG GAG AAC TGG TCA TTA TCT 
Sr S Ser Tyr Asn Ser Gly Ser Arg Gin Glu Asn Trp Ser Leu Ser 



444 



492 



540 



588 



636 



684 



732 



780 



828 



876 



924 



TGT GTT TCG CAT AAT GAA ACG CAG AGA CAA AAC CCC CAT TAT GAT GCT 
Z S2 Ser Ss Asn Glu Thr Gin Arg Gin Asn Pro His Tyr Asp Ala 



290 



GTA ATT ATG ACG GCT CCT CTG TGC AAT GTG AAG GAG ATG AAG GTT ATG 
S£ ne Met Thr Ala Pro Leu Cys Asn Val Lys Glu Met Lys Val Met 
305 310 J 



972 



1020 
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AAA GGA GGA CAA CCC TTT CAG CTA AAC TTT CTC CCC GAG ATT AAT TAC 1068 
Lys Gly Gly Gin Pro Phe Gin Leu Asn Phe Leu Pro Glu He Asn Tyr 
320 325 330 

ATG CCC CTC TCG GTT TTA ATC ACC ACA TTC ACA AAG GAG AAA GTA AAG 1116 
Met Pro Leu Ser Val Leu He Thr Thr Phe Thr Lys Glu Lys Val Lys 
335 340 345 

AGA CCT CTT GAA GGC TTT GGG GTA CTC ATT CCA TCT AAG GAG CAA AAG 1164 
Arg Pro Leu Glu Gly Phe Gly Val Leu He Pro Ser Lys Glu Gin Lys 
350 355 360 365 



cat ggt TTC kCI CIA GGT KC& CTT TTT TCA TCA ATG ATG TTT CCA 1212 
His Gly Phe Lys Thr Leu Gly Thr Leu Phe Ser Ser Met Met Phe Pro 
370 375 3 8 o 

GAT CGT TCC CCT ACT GAC GTT CAT CTA TAT ACA ACT TTT ATT GGT GGG 1260 
Asp Ar~ Ser Pro Ser Asp Val Hie Leu Tyr Thr Thr Phe lie Gly Gly 
385 390 3g5 

f 1, f 36 ^ C CAG GAA CTA GCC AAA GCT TCC ACT GAC GAA TTA AAA CAA 1308 
Ser Arg Asn Gin Glu Leu Ala Lys Ala Ser Thr Asp Glu Leu Lys Gin 
400 405 410 

ST f ? I 01 ^ 017 016 ^ SAA GGT GAA CCC 1356 

Val Val Thr Ser Asp Leu Gin Arg Leu Leu Gly Val Glu Gly Glu Pro 
415 420 425 

S? f C ^ T TAC TAT TGG AGG AAA GCA TTC CCG TTG TAT GAC 1404 

Val Ser Val Asn H 1S Tyr Tyr Trp Arg Lys Ala Phe Pro Leu Tyr Asp 
435 4 40 445 

AGO AGC TAT GAC TCA GTC ATG GAA GCA ATT GAC AAG ATG GAG AAT GAT 1 452 
Ser Ser Tyr Asp Ser Val Met Glu Ala lie Asp Lys Met Glu Asn Asp 
450 455 4 60 

CTA CCT GGG TTC TTC TAT GCA GGT AAT CAT CGA GGG GGG CTC TCT GTT 1500 
Leu Pro Gly Phe Phe Tyr Ala Gly Asn His Arg Gly Gly Leu Ser Val 
465 470 475 

GGG AAA TCA ATA GCA TCA GGT TGC AAA GCA GCT GAC CTT GTG ATC TCA 1548 
Gly Lys Ser lie Ala Ser Gly Cys Lys Ala Ala Asp Leu Val He Ser 



TAC^CTG GAG TCT TGC TCA AAT GAC AAG AAA CCA AAT GAC AGC TTA TAACATTGTC 

Tyr Leu Glu Ser Cys Ser Asn Asp Lys Lys Pro Asn Asp Ser Leu 
495 500 5 05 

AAGGTTCGTC CCTTTTTATC ACTTACTTTG TAAACTTGTA AAATGCAACA AGCCGCCGTG 1663 
CGATTAGCCA ACAACTCAGC AAAACCCAGA TTCTCATAAG GCTCACTAAT TCCAGAATAA 1723 
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ACTATTTATG TAAAA 1738 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Ser Gly Ala Val Ala Asp His Gin lie Glu Ala Val Ser Gly 
15 10 15 

Lys Arg Val Ala Val Val Gly Ala Gly Val Ser Gly Leu Ala Ala Ala 
20 25 30 

Tyr Lys Leu Lys Ser Arg Gly Leu Asn Val Thr Val Phe Glu Ala Asp 
35 40 45 

Gly Arg Val Gly Gly Lys Leu Arg Ser Val Met Gin Asn Gly Leu He 
50 55 60 

Trp Asp Glu Gly Ala Asn Thr Met Thr Glu Ala Glu Pro Glu Val Gly 
65 70 75 80 

Ser Leu Leu Asp Asp Leu Gly Leu Arg Glu Lys Gin Gin Phe Pro He 
85 90 95 

Ser Gin Lys Lys Arg Tyr He Val Arg Asn Gly Val Pro Val Met Leu 
100 105 110 

Pro Thr Asn Pro He Glu Leu Val Thr Ser Ser Val Leu Ser Thr Gin 
115 120 125 

Ser Lys Phe Gin He Leu Leu Glu Pro Phe Leu Trp Lys Lys Lys Ser 
130 135 140 

Ser Lys Val Ser Asp Ala Ser Ala Glu Glu Ser Val Ser Glu Phe Phe 
145 150 155 160 

Gin Arg His Phe Gly Gin Glu Val Val Asp Tyr Leu He Asp Pro Phe 
165 170 175 

Val Gly Gly Thr Ser Ala Ala Asp Pro Asp Ser Leu Ser Met Lys His 
180 185 190 

Ser Phe Pro Asp Leu Trp Asn Val Glu Lys Ser Phe Gly Ser He He 
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195 200 205 

Val Gly Ala lie Arg Thr Lys Phe Ala Ala Lys Gly Gly Lys Ser Arg 
210 215 220 

Asp Thr Lys Ser Ser Pro Gly Thr Lys Lys Gly Ser Arg Gly Ser Phe 
225 230 235 240 

Ser Phe Lys Gly Gly Met Gin He Leu Pro Asp Thr Leu Cys Lys Ser 
245 250 255 

Leu Ser His Asp Glu He Asn Leu Asp Ser Lys Val Leu Ser Leu Ser 
260 265 270 

Tyr Asn Ser Gly Ser Arg Gin Glu Asn Trp Ser Leu Ser Cys Val Ser 
275 280 285 

Hxs Asn Glu Thr Gin Arg Gin Asn Pro His Tyr Asp Ala Val He Met 
290 295 300 

Thr Ala Pro Leu Cys Asn Val Lys Glu Met Lys Val Met Lys Gly Gly 
305 310 315 320 

Gin Pro Phe Gin Leu Asn Phe Leu Pro Glu He Asn Tyr Met Pro Leu 
325 330 335 

Ser Val Leu lie Thr Thr Phe Thr Lys Glu Lys Val Lys Arg Pro Leu 
340 345 350 

Glu Gly Phe Gly Val Leu He Pro Ser Lys Glu Gin Lys His Gly Phe 
355 360 365 

Lys Thr Leu Gly Thr Leu Phe Ser Ser Met Met Phe Pro Asp Arg Ser 
370 375 380 

Pro Ser Asp Val His Leu Tyr Thr Thr Phe He Gly Gly Ser Arg Asn 
385 390 395 400 

Gin Glu Leu Ala Lys Ala Ser Thr Asp Glu Leu Lys Gin Val Val Thr 
405 410 415 

Ser Asp Leu Gin Arg Leu Leu Gly V al Glu Gly Glu Pro Val Ser Val 

MJU 



Asn His Tyr Tyr Trp Arg Lys Ala Phe Pro Leu Tyr Asp Ser Ser Tyr 
435 440 445 

Asp Ser Val Met Glu Ala lie Asp Lys Met Glu Asn Asp Leu Pro Gly 
450 455 460 



Phe Phe Tyr Ala Gly Asn His Arg Gly Gly Leu Ser Val Gly Lys Ser 
465 470 475 480 
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lie Ala Ser Gly Cys Lys Ala Ala Asp Leu Val He Ser Tyr Leu Glu 
485 490 495 




Ser Cys Ser Asn Asp Lys Lys Pro Asn Asp Ser Leu 
500 505 




(2) information for sec? id no:d: 




(1) SLQUENCh, LnAKACIKKlb 1 lUo . 

(A) LENGTH: 1698 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: cDNA 




(iii) HYPOTHETICAL: NO 




(iv) ANTI-SENSE: NO 




(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 1453 

(D) OTHER INFORMATION: /note- "Maize protox-1 cDNA (not 
full-length); sequence from pWDC-4" 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 




G AAT TCG GCG GAC TGC GTC GTG GTG GGC GGA GGC ATC AGT GGC CTC 
Asn Ser Ala Asp Cys Val Val Val Gly Gly Gly He Ser Gly Leu 
15 10 15 


46 


TGC ACC GCG CAG GCG CTG GCC ACG CGG CAC GGC GTC GGG GAC GTG CTT 
Cys Thr Ala Gin Ala Leu Ala Thr Arg His Gly Val Gly Asp Val Leu 

ZU ZD JU 


94 


GTC ACG GAG GCC CGC GCC CGC CCC GGC GGC AAC ATT ACC ACC GTC GAG 
Val Thr Glu Ala Arg Ala Arg Pro Gly Gly Asn He Thr Thr Val Glu 


142 


j>j 




CGC CCC GAG GAA GGG TAC CTC TGG GAG GAG GGT CCC AAC AGC TTC CAG 
Arg Pro Glu Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin 
50 55 60 


190 


CCC TCC GAC CCC GTT CTC ACC ATG GCC GTG GAC AGC GGA CTG AAG GAT 
Pro Ser Asp Pro Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp 
65 70 75 


238 
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GAC TTG GTT TTT GGG GAC CCA AAC GCG CCG CGT TTC GTG CTG TGG GAG 286 

Asp Leu Val Phe Gly Asp Pro Asn Ala Pro Arg Phc Val Leu Trp Glu 
80 85 90 95 

GGG AAG CTG AGG CCC GTG CCA TCC AAG CCC GCC GAC CTC CCG TTC TTC 334 
Gly Lys Leu Arg Pro Val Pro Ser Lys Pro Ala Asp Leu Pro Phe Phe 
100 105 110 

GAT CTC ATG AGC ATC CCA GGG AAG CTC AGG GCC GGT CTA GGC GCG CTT 382 
Asp Leu Met Ser lie Pro Gly Lys Leu Arg Ala Gly Leu Gly Ala Leu 
115 120 125 

GGC ATC CGC CCG CCT CCT CCA GGC CGC GAA GAG TCA GTG GAG GAG TTC 430 
Gly lie Arg Pro Pro Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe 
130 135 140 



GTG CGC CGC AAC CTC GGT GCT GAG GTC TTT GAG CGC CTC ATT GAG CCT 478 
Val Arg Arg Asn Leu Gly Aid Glu Val Fhe Glu Arg Leu lie Glu Pro 
145 150 155 



TTC TGC TCA GGT GTC TAT GCT GGT GAT CCT TCT AAG CTC AGC ATG AAG 526 
Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys 
160 165 170 175 

GCT GCA TTT GGG AAG GTT TGG CGG TTG GAA GAA ACT GGA GGT AGT ATT 574 
Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Glu Thr Gly Gly Ser lie 
180 185 190 

ATT GGT GGA ACC ATC AAG ACA ATT CAG GAG AGG AGC AAG AAT CCA AAA 622 
lie Gly Gly Thr He Lys Thr He Gin Glu Arg Ser Lys Asn Pro Lys 
195 200 205 

CCA CCG AGG GAT GCC CGC CTT CCG AAG CCA AAA GGG CAG ACA GTT GCA 670 
Pro Pro Arg Asp Ala Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Ala 
210 215 220 

TCT TTC AGG AAG GGT CTT GCC ATG CTT CCA AAT GCC ATT ACA TCC AGC 718 
Ser Phe Arg Lys Gly Leu Ala Met Leu Pro Asn Ala He Thr Ser Ser 
225 230 235 

TTG GGT AGT AAA GTC AAA CTA TCA TGG AAA CTC ACG AGC ATT ACA AAA 766 
Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu Thr Ser He Thr Lys 

240 Mb iJU m 



TCA GAT GAC AAG GGA TAT GTT TTG GAG TAT GAA ACG CCA GAA GGG GTT 814 
Ser Asp Asp Lys Gly Tyr Val Leu Glu Tyr Glu Thr Pro Glu Gly Val 
260 265 270 

GTT TCG GTG CAG GCT AAA AGT GTT ATC ATG ACT ATT CCA TCA TAT GTT 862 
Val Ser Val Gin Ala Lys Ser Val He Met Thr He Pro Ser Tyr Val 
275 280 285 
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GCT AGC AAC ATT TTG CGT CCA CTT TCA AGC GAT GCT GCA GAT GCT CTA 910 
Ala Ser Asn He Leu Arg Pro Leu Ser Ser Asp Ala Ala Asp Ala Leu 
290 295 300 

TCA AGA TTC TAT TAT CCA CCG GTT GCT GCT GTA ACT GTT TCG TAT CCA 958 
Ser Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro 
305 310 315 

AAG GAA GCA ATT AGA AAA GAA TGC TTA ATT GAT GGG GAA CTC CAG GGC 1006 
Lys Glu Ala He Arg Lys Glu Cys Leu He Asp Gly Glu Leu Gin Gly 
320 325 330 335 

TTT GGC CAG TTG CAT CCA CGT AGT CAA GGA GTT GAG ACA TTA GGA ACA 1054 
Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr 
340 345 350 

ATA TAC AGT TCC TCA CTC TTT CCA AAT CGT GCT CCT GAC GGT AGG GTG 1102 
He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Aid Pro Asp Gly Arg Val 
355 360 365 

TTA CTT CTA AAC TAC ATA GGA GGT GCT ACA AAC ACA GGA ATT GTT TCC 1150 
Leu Leu Leu Asn Tyr He Gly Gly Ala Thr Asn Thr Gly He Val Ser 
370 375 380 

AAG ACT GAA AGT GAG CTG GTC GAA GCA GTT GAC CGT GAC CTC CGA AAA 1198 
Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys 
385 390 395 

ATG CTT ATA AAT TCT ACA GCA GTG GAC CCT TTA GTC CTT GGT GTT CGA 1246 
Met Leu He Asn Ser Thr Ala Val Asp Pro Leu Val Leu Gly Val Arg 
400 405 410 415 

GTT TGG CCA CAA GCC ATA CCT CAG TTC CTG GTA GGA CAT CTT GAT CTT 1294 
Val Trp Pro Gin Ala lie Pro Gin Phe Leu Val Gly His Leu Asp Leu 
420 425 430 

CTG GAA GCC GCA AAA GCT GCC CTG GAC CGA GGT GGC TAC GAT GGG CTG 1342 
Leu Glu Ala Ala Lys Ala Ala Leu Asp Arg Gly Gly Tyr Asp Gly Leu 
435 440 445 

TTC CTA GGA GGG AAC TAT GTT GCA GGA GTT GCC CTG GGC AGA TGC GTT 1390 

! £he = J d fcu_Gl^Gl^A5JT-^^ Ala Gly Val Ala Leu Gly Arg Cys Val 

450 455 <*t>u 

GAG GGC GCG TAT GAA AGT GCC TCG CAA ATA TCT GAC TTC TTG ACC AAG 1438 
Glu Gly Ala Tyr Glu Ser Ala Ser Gin He Ser Asp Phe Leu Thr Lys 
465 470 475 

TAT GCC TAC AAG TGATGAAAGA AGTGGAGCGC TACTTGTTAA TCGTTTATGT 1490 

Tyr Ala Tyr Lys 

480 
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1550 


TGTAAATTGC ATTTCTGTTC TTTTTTCTAT CAGTAATTAG TTATATTTTA GTTCTGTAGG 


1610 


AGATTGTTCT GTTCACTGCC CTTCAAAAGA AATTTTATTT TTCATTCTTT TATGAGAGCT 


1670 


GTGCTACTTA AAAAAAAAAA AAAAAAAA 


1698 


(2) INFORMATION FOR SEQ ID NO: 6: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 483 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: protein 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 




Asn Ser Ala Asp Cys Val Val Val Gly Gly Gly lie Ser Gly Leu Cys 
15 10 15 




Thr Ala Gin Ala Leu Ala Thr Arg His Gly Val Gly Asp Val Leu Val 
20 25 30 




Thr Glu Ala Arg Ala Arg Pro Gly Gly Asn lie Thr Thr Val Glu Arg 
35 40 45 




Pro Glu Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro 

en ca cr\ 
50 5b 60 




Ser Asp Pro Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp Asp 

cc ir\ tc on 
OD /u ID oU 




Leu Val Phe Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Glu Gly 

oc on qc 

od yu yo 




Lys Leu Arg Pro Val Pro Ser Lys Pro Ala Asp Leu Pro Phe Phe Asp 

i r\c\ int; nn 
1UU JLUo llv 




Leu Met Ser lie Pro Gly Lys Leu Arg Ala Gly Leu Gly Ala Leu Gly 




1U 110 ILj 




lie Arg Pro Pro Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe Val 
130 135 140 




Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg Leu He Glu Pro Phe 
145 150 155 160 




Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala 
165 170 175 
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Ala Phe Gly Lys Val Trp Arg Leu Glu Glu Thr Gly Gly Ser He He 
180 185 190 

Gly Gly Thr He Lys Thr He Gin Glu Arg Ser Lys Asn Pro Lys Pro 
195 200 205 

Pro Arg Asp Ala Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Ala Ser 
210 215 220 

Phe Arg Lys Gly Leu Ala Met Leu Pro Asn Ala He Thr Ser Ser Leu 
225 230 235 240 

Gly Ser Lys Val Lys Leu Ser Trp Lys Leu Thr Ser He Thr Lys Ser 
245 250 255 

Asp Asp Lys Gly Tyr Val Leu Glu Tyr Glu Thr Pro Glu Gly Val Val 



Ser Val Gin Ala Lys Ser Val He Met Thr He Pro Ser Tyr Val Ala 
275 280 285 

Ser Asn He Leu Arg Pro Leu Ser Ser Asp Ala Ala Asp Ala Leu Ser 
290 295 300 

Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro Lys 
305 310 315 320 

Glu Ala He Arg Lys Glu Cys Leu He Asp Gly Glu Leu Gin Gly Phe 
325 330 335 

Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He 
340 345 350 

Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro Asp Gly Arg Val Leu 
355 360 365 

Leu Leu Asn Tyr He Gly Gly Ala Thr Asn Thr Gly He Val Ser Lys 
370 375 380 

Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys Met 

385 390 395 400 



Leu He Asn Ser Thr Ala Val Asp Pro Leu Val Leu Gly Val Arg Val 

405 410 415 

Trp Pro Gin Ala He Pro Gin Phe Leu Val Gly His Leu Asp Leu Leu 

420 425 430 



Glu Ala Ala Lys Ala Ala Leu Asp Arg Gly Gly Tyr Asp Gly Leu Phe 
435 440 445 
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Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys Val Glu 
450 455 460 

Gly Ala Tyr Glu Ser Ala Ser Gin He Ser Asp Phe Leu Thr Lys Tyr 
465 470 475 480 

Ala Tyr Lys 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2061 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 64.. 1698 

(D) OTHER INFORMATION : /note- "Maize protox-2 cDNA; 
sequence from pWDC-3" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CTCTCCTACC TCCACCTCCA CGACAACAAG CAAATCCCCA TCCAGTTCCA AACCCTAACT 60 

CAA ATG CTC GCT TTG ACT GCC TCA GCC TCA TCC GCT TCG TCC CAT CCT 108 
Met Leu Ala Leu Thr Ala Ser Ala Ser Ser Ala Ser Ser His Pro 
15 10 15 

TAT CGC CAC GCC TCC GCG CAC ACT CGT CGC CCC CGC CTA CGT GCG GTC 156 
Tyr Arg His Ala Ser Ala His Thr Arg Arg Pro Arg Leu Arg Ala Val 

35 30 

CTC GCG ATG GCG GGC TCC GAC GAC CCC CGT GCA GCG CCC GCC AGA TCG 204 
Leu Ala Met Ala Gly Ser Asp Asp Pro Arg Ala Ala Pro Ala Arg Ser 
35 40 45 

GTC GCC GTC GTC GGC GCC GGG GTC AGC GGG CTC GCG GCG GCG TAC AGG 252 
Val Ala Val Val Gly Ala Gly Val Ser Gly Leu Ala Ala Ala Tyr Arg 
50 55 60 
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CTC AGA CAG AGO GGC GTG AAC GTA ACG GTG TTC GAA GCG GCC GAC AGG 
Leu Arg Gin Ser Gly Val Asn vai inr vai rne blu Ala Ala Asp Arg 
65 70 75 


300 


GCG GGA GGA AAG ATA CGG ACC AAT TCC GAG GGC GGG TTT GTC TGG GAT 
Ala Gly Gly Lys lie Arg Tnr Asn ber biu biy biy rne vai irp Asp 
80 85 90 95 


348 


GAA GGA GCT AAC ACC ATG ACA GAA GGT GAA TGG GAG GCC AGT AGA CTG 
n,, ri„ tv i a &e n Th r Mot Thr (^lii niv C"iln Tm Gin Ala Ser Arcr Leu 
100 105 110 


396 


ATT GAT GAT CTT GGT CTA CAA GAC AAA CAG CAG TAT CCT AAC TCC CAA 
He Asp Asp Lieu Gly Leu bin Asp .Lys bin bin lyr rro ftsn jer lain 
115 120 125 


444 


CAC AAG CGT TAC ATT GTC AAA GAT GGA GCA CCA GCA CTG ATT CCT TCG 

MIS lys MX y 1 y I lie va-i Jjya nSp vj-Ly ma r TO ma jjciu ^ j-w »->*_j- 

130 135 140 


492 


GAT CCC ATT TCG CTA ATG AAA AGC AGT GTT CTT TCG ACA AAA TCA AAG 
Asp Pro He Ser Leu Met Lys Ser Ser Val Leu Ser Thr Lys Ser Lys 
145 150 155 


540 


ATT GCG TTA TTT TTT GAA CCA TTT CTC TAC AAG AAA GCT AAC ACA AGA 
He Ala Leu Phe Phe Glu Pro Phe Leu Tyr Lys Lys Ala Asn Thr Arg 
160 165 170 175 


588 


AAC TCT GGA AAA GTG TCT GAG GAG CAC TTG AGT GAG AGT GTT GGG AGC 
Asn Ser Gly Lys Val Ser Glu Glu His Leu Ser Glu Ser Val Gly Ser 
180 185 190 


636 


TTC TGT GAA CGC CAC TTT GGA AGA GAA GTT GTT GAC TAT TTT GTT GAT 
Phe Cys Glu Arg His Phe Gly Arg Glu Val Val Asp Tyr Phe Val Asp 
195 200 205 


684 


CCA TTT GTA GCT GGA ACA AGT GCA GGA GAT CCA GAG TCA CTA TCT ATT 

n _ _ tjU— \ i_ i TV 1 — » pi ,. rpVt v Ctfk'r- M a ^1 ii TV crr~\ Drrt fZln Cor T x»i l Cor Tl a 

Pro fne vai A_La biy inr ber /\j_a oj.y nsp "ro oiu jci icu ot=xr i±c 
210 215 220 


732 


CGT CAT GCA TTC CCA GCA TTG TGG AAT TTG GAA AGA AAG TAT GGT TCA 
Arq His Ala Phe Pro Ala Leu Trp Asn Leu Glu Arg Lys Tyr Gly Ser 


780 


275 ~ SO " H5 




GTT ATT GTT GGT GCC ATC TTG TCT AAG CTA GCA GCT AAA GGT GAT CCA 
Val He Val Gly Ala He Leu Ser Lys Leu Ala Ala Lys Gly Asp Pro 
240 245 250 255 


828 


GTA AAG ACA AGA CAT GAT TCA TCA GGG AAA AGA AGG AAT AGA CGA GTG 
Val Lys Thr Arg His Asp Ser Ser Gly Lys Arg Arg Asn Arg Arg Val 
260 265 270 


876 
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TCG TTT TCA TTT CAT GGT GGA ATG CAG TCA CTA ATA AAT GCA CTT CAC 924 
Ser Phe Ser Phe His Gly Gly Met Gin Ser Leu lie Asn Ala Leu His 
275 280 285 

AAT GAA GTT GGA GAT GAT AAT GTG AAG CTT GGT ACA GAA GTG TTG TCA 972 
Asn Glu Val Gly Asp Asp Asn Val Lys Leu Gly Thr Glu Val Leu Ser 
290 295 300 

TTG GCA TGT ACA TTT GAT GGA GTT CCT GCA CTA GGC AGG TGG TCA ATT 1020 
Leu Ala Cys Thr Phe Asp Gly Val Pro Ala Leu Gly Arg Trp Ser He 
305 310 315 

TCT GTT GAT TCG AAG GAT AGC GGT GAC AAG GAC CTT GCT AGT AAC CAA 1068 
Ser Val Asp Ser Lys Asp Ser Gly Asp Lys Asp Leu Ala Ser Asn Gin 
320 325 330 335 

ACC TTT GAT GCT GTT ATA ATG ACA GCT CCA TTG TCA AAT GTC CGG AGG 1116 
Thr Phe Asp Ala Val He Met Thr Ala Prn T/?ii Ser Ann Val Arg Arg 
340 345 350 

ATG AAG TTC ACC AAA GGT GGA GCT CCG GTT GTT CTT GAC TTT CTT CCT 1164 
Met Lys Phe Thr Lys Gly Gly Ala Pro Val Val Leu Asp Phe Leu Pro 
355 360 365 

AAG ATG GAT TAT CTA CCA CTA TCT CTC ATG GTG ACT GCT TTT AAG AAG 1212 
Lys Met Asp Tyr Leu Pro Leu Ser Leu Met Val Thr Ala Phe Lys Lys 
370 375 380 

GAT GAT GTC AAG AAA CCT CTG GAA GGA TTT GGG GTC TTA ATA CCT TAC 1260 
Asp Asp Val Lys Lys Pro Leu Glu Gly Phe Gly Val Leu He Pro Tyr 
385 390 395 

AAG GAA CAG CAA AAA CAT GGT CTG AAA ACC CTT GGG ACT CTC TTT TCC 1308 
Lys Glu Gin Gin Lys His Gly Leu Lys Thr Leu Gly Thr Leu Phe Ser 
400 405 410 415 

TCA ATG ATG TTC CCA GAT CGA GCT CCT GAT GAC CAA TAT TTA TAT ACA 1356 
Ser Met Met Phe Pro Asp Arg Ala Pro Asp Asp Gin Tyr Leu Tyr Thr 
420 425 430 

ACA TTT GTT GGG GGT AGC CAC AAT AGA GAT CTT GCT GGA GCT CCA ACG 1404 
Thr Phe Val Gly Gly Ser His Asn Arg Asp Leu Ala Gly Ala Pro Thr 
fllft 1J? no 

TCT ATT CTG AAA CAA CTT GTG ACC TCT GAC CTT AAA AAA CTC TTG GGC 1452 
Ser He Leu Lys Gin Leu Val Thr Ser Asp Leu Lys Lys Leu Leu Gly 
450 455 460 

GTA GAG GGG CAA CCA ACT TTT GTC AAG CAT GTA TAC TGG GGA AAT GCT 1500 
Val Glu Gly Gin Pro Thr Phe Val Lys His Val Tyr Trp Gly Asn Ala 
465 470 475 
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TTT CCT TTG TAT GGC CAT GAT TAT AGT TCT GTA TTG GAA GCT ATA GAA 
Phe Pro Leu Tyr Gly His Asp Tyr Ser Ser Val Leu Glu Ala lie Glu 
480 485 490 495 


1548 


AAG ATG GAG AAA AAC CTT CCA GGG TTC TTC TAC GCA GGA AAT AGC AAG 
Lys Met Glu Lys Asn Leu Pro Gly Phe Phe Tyr Ala Gly Asn Ser Lys 
500 505 510 


1596 


GAT GGG CTT GCT GTT GGA AGT GTT ATA GCT TCA GGA AGC AAG GCT GCT 
Asp Gly Leu Ala val bly ber vai lie Ala ber (jiy ber J-«y5 /u.a /ua 
515 520 525 


1644 


GAC CTT GCA ATC TCA TAT CTT GAA TCT CAC ACC AAG CAT AAT AAT TCA 
Asp Leu Ala lie Ser Tyr Leu Glu Ser His Thr Lys His Asn Asn Ser 
530 535 540 


1692 


CAT TGAAAGTGTC TGACCTATCC TCTAGCAGTT GTCGACAAAT TTCTCCAGTT 
Kis 

545 


1745 


CATGTACAGT AGAAACCGAT GCGTTGCAGT TTCAGAACAT CTTCACTTCT TCAGATATTA 


1805 


ACCCTTCGTT GAACATCCAC CAGAAAGGTA GTCACATGTG TAAGTGGGAA AATGAGGTTA 
AAAACTATTA TGGCGGCCGA AATGTTCCTT 'll'lCTTTTCC TCACAAGTGG CCTACGACAC 


1865 
1925 


TTGATGTTGG AAATACATTT AAATTTGTTG AATTGTTTGA GAACACATGC GTGACGTGTA 


1985 


ATATTTGCCT ATTGTGATTT TAGCAGTAGT CTTGGCCAGA TTATGCTTTA CGCCTTTAAA 


2045 


AAAAAAAAAA AAAAAA 


2061 


(2) INFORMATION FOR SEQ ID NO: 8: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: protein 




(xi) ^OfUkULk DLSLRIWIUN! i>LU lU 1MU ! S ! 




Met Leu Ala Leu Thr Ala Ser Ala Ser Ser Ala Ser Ser His Pro Tyr 
15 10 15 




Arg His Ala Ser Ala His Thr Arg Arg Pro Arg Leu Arg Ala Val Leu 
20 25 30 




Ala Met Ala Gly Ser Asp Asp Pro Arg Ala Ala Pro Ala Arg Ser Val 
35 40 45 
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Ala Val Val Gly Ala Gly Val Ser Gly Leu Ala Ala Ala Tyr Arg Leu 
50 55 60 

Arg Gin Ser Gly Val Asn Val Thr Val Phe Glu Ala Ala Asp Arg Ala 
65 70 75 80 

Gly Gly Lys lie Arg Thr Asn Ser Glu Gly Gly Phe Val Trp Asp Glu 
85 90 95 

Gly Ala Asn Thr Met Thr Glu Gly Glu Trp Glu Ala Ser Arg Leu lie 
100 105 110 

Asp Asp Leu Gly Leu Gin Asp Lys Gin Gin Tyr Pro Asn Ser Gin His 
115 120 125 

Lys Arg Tyr He Val Lys Asp Gly Ala Pro Ala Leu He Pro Ser Asp 



Pro He Ser Leu Met Lys Ser Ser Val Leu Ser Thr Lys Ser Lys He 
145 150 155 160 

Ala Leu Phe Phe Glu Pro Phe Leu Tyr Lys Lys Ala Asn Thr Arg Asn 
165 170 175 

Ser Gly Lys Val Ser Glu Glu His Leu Ser Glu Ser Val Gly Ser Phe 
180 185 190 

Cys Glu Arg His Phe Gly Arg Glu Val Val Asp Tyr Phe Val Asp Pro 
195 200 205 

Phe Val Ala Gly Thr Ser Ala Gly Asp Pro Glu Ser Leu Ser He Arg 
210 215 220 

His Ala Phe Pro Ala Leu Trp Asn Leu Glu Arg Lys Tyr Gly Ser Val 
225 230 235 240 

lie Val Gly Ala He Leu Ser Lys Leu Ala Ala Lys Gly Asp Pro Val 
245 250 255 

Lys Thr Arg His Asp Ser Ser Gly Lys Arg Arg Asn Arg Arg Val Ser 
260 265 270 



Phe Ser Phe His Gly Gly Met Gin Ser Leu He Asn Ala Leu His Asn 

275 280 285 

Glu Val Gly Asp Asp Asn Val Lys Leu Gly Thr Glu Val Leu Ser Leu 
290 295 300 



Ala Cys Thr Phe Asp Gly Val Pro Ala Leu Gly Arg Trp Ser He Ser 
305 310 315 320 
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Val Asp Ser Lys Asp Ser Gly Asp Lys Asp Leu Ala Ser Asn Gin Thr 
325 330 335 

Phe Asp Ala Val lie Met Thr Ala Pro Leu Ser Asn Val Arg Arg Met 
340 345 350 

Lys Phe Thr Lys Gly Gly Ala Pro Val Val Leu Asp Phe Leu Pro Lys 
355 360 365 

Met Asp Tyr Leu Pro Leu Ser Leu Met Val Thr Ala Phe Lys Lys Asp 
370 375 380 

Asp Val Lys Lys Pro Leu Glu Gly Phe Gly Val Leu He Pro Tyr Lys 
385 390 395 400 

Glu Gin Gin Lys His Gly Leu Lys Thr Leu Gly Thr Leu Phe Ser Ser 
405 410 415 

Met Met Phe Pro Asp Arg Ala Pro Asp Asp Gin Tyr Leu Tyr Thr Thr 
420 425 430 

Phe Val Gly Gly Ser His Asn Arg Asp Leu Ala Gly Ala Pro Thr Ser 
435 440 445 

He Leu Lys Gin Leu Val Thr Ser Asp Leu Lys Lys Leu Leu Gly Val 
450 455 460 

Glu Gly Gin Pro Thr Phe Val Lys His Val Tyr Trp Gly Asn Ala Phe 
465 470 475 480 

Pro Leu Tyr Gly His Asp Tyr Ser Ser Val Leu Glu Ala He Glu Lys 
485 490 495 



Met Glu Lys Asn Leu Pro Gly Phe Phe Tyr Ala Gly Asn Ser Lys Asp 
500 505 510 

Gly Leu Ala Val Gly Ser Val He Ala Ser Gly Ser Lys Ala Ala Asp 
515 520 525 

Leu Ala He Ser Tyr Leu Glu Ser His Thr Lys His Asn Asn Ser His 
530 535 540 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: cDNA 




(iii) HYPOTHETICAL: NO 




(iv) ANTI-SENSE: NO 




(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 29.. 1501 

(D) OTHER INFORMATION: /note- "yeast protox-3 cDNA; 
sequence from pWDC-5" 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 




TTGGCATTTG CCTTGAACCA ACAATTCT ATG TCA ATT GCA ATT TGT GGA GGA 

Met Ser He Ala He Cys Gly Gly 
1 5 


52 


GGT ATA GCT GGT CTT AGT ACA GCA TTT TAT CTT GCT AGA TTG ATT CCA 
Gly He Ala Gly Leu Ser Thr Ala Phe Tyr Leu Ala Arg Leu He Pro 
10 15 20 


100 


AAA TGT ACT ATT GAT TTG TAC GAA AAA GGT CCT CGT TTA GGT GGA TGG 
Lys Cys Thr lie Asp Leu Tyr Glu Lys Gly Pro Arg Leu Gly Gly Trp 
25 30 35 40 


148 


CTT CAG TCG GTC AAA ATC CCG TGT GCA GAT TCT CCA ACA GGA ACG GTT 
Leu Gin Ser Val Lys He Pro Cys Ala Asp Ser Pro Thr Gly Thr Val 
45 50 55 


196 


TTG TTT GAG CAA GGT CCT AGA ACT CTT CGT CCT GCT GGG GTT GCT GGC 
Leu Phe Glu Gin Gly Pro Arg Thr Leu Arg Pro Ala Gly Val Ala Gly 
60 65 *70 


244 


TTA GCA AAC TTA GAT TTA ATT AGC AAG TTG GGC ATC GAA GAC AAG TTG 
Leu Ala Asn Leu Asp Leu lie Ser Lys Leu Gly He Glu Asp Lys Leu 
75 o0 OD 


292 


Leu Arg lie Ser Ser Asn Ser Pro Ser Ala Lys Asn Arg Tyr lie Tyr 
90 95 100 




TAC CCA GAT CGC TTA AAT GAA ATT CCT TCA AGC ATT TTA GGG AGT ATA 
Tyr Pro Asp Arg Leu Asn Glu lie Pro Ser Ser He Leu Gly Ser lie 
105 110 115 120 


388 


AAG TCG ATT ATG CAG CCT GCT TTG CGT CCG ATG CCT TTG GCT ATG ATG 
Lys Ser He Met Gin Pro Ala Leu Arg Pro Met Pro Leu Ala Met Met 


436 
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125 130 135 




CTT GAG CCC TTT CGT AAA AGT AAG CGA GAT TCG ACA GAT GAA AGC GTG 
Leu Glu Pro Phe Axg Lys Ser Lys Arg Asp Ser Thr Asp Glu Ser Val 
140 145 150 


484 


GGT TCA TTT ATG AGA AGA AGA TTT GGT AAA AAC GTT ACG GAT AGA GTT 
Gly Sex Pne Met Arg Arg Arg rne t^iy iiys Asn vax inx Asp Arg vaj. 
155 160 165 


532 


ATG AGT GCA ATG ATA AAT GGT ATT TAT GCT GGT GAT TTG AAT GAT TTG 

%m, A— £"*—.—. 7i 1 Kiln 4- T 1 ^ Ar*r-k Pi it Tin TH »■»- 7A 1 1 if A ot-\ T i & en & cri T ai i 

Met ber Ala Met lie Asn uj.y u.e lyr Aj_a vj±y Asp l«u Aon /isp 1*2 u 
170 175 180 


580 


TCT ATG CAT TCT AGC ATG TTT GGA TTT TTA GCG AAG ATT GAA AAA AAG 
Ser Met His Ser Ser Met Phe Gly Phe Leu Ala Lys lie Glu Lys Lys 
185 190 195 200 


628 


TAT GGA AAC ATT ACT TTG GGA TTA ATT AGA GCT CTT CTT GCA CGT GAA 
Tyr Gly Asn lie Thr Leu Gly Leu lie Arg Ala Leu Leu Ala Arg Glu 
205 210 215 


676 


ATA TTA TCT CCT GCT GAG AAA GCT TTG GAA AGC AGC ACT ACT CGC AGA 
lie Leu Ser Pro Ala Glu Lys Ala Leu Glu Ser Ser Thr Thr Arg Arg 
220 225 230 


724 


GCC AAA AAC AGC AGA GCT GTC AAA CAG TAT GAA ATC GAC AAG TAT GTT 
Ala Lys Asn Ser Arg Ala Val Lys Gin Tyr Glu lie Asp Lys Tyr Val 
235 240 245 


772 


GCT TTC AAG GAA GGG ATT GAG ACT ATT ACA TTG TCA ATA GCA GAT GAA 
Ala Phe Lys Glu Gly He Glu Thr He Thr Leu Ser He Ala Asp Glu 
250 255 260 


820 


TTA AAA AAA ATG CCG AAT GTC AAG ATA CAT CTA AAC AAA CCG GCC CAA 
Leu Lys Lys Met Pro Asn Val Lys He His l^u Asn Lys Pro Ala Gin 
265 270 275 280 


868 


ACT TTG GTT CCA CAT AAA ACT CAG TCT CTT GTA GAC GTC AAT GGT CAA 

ini 1j6U Val rio tllS Jjyo iiix oiu oci i-rtru val Aop vax nou oxy uxi i 

285 290 295 


916 


bcl 1AL UKb 1A1 111 blUi 111 JUl AML ILL 1L1 LUC AM HA UM Ml 

Ala Tyr Glu Tyr Val Val Phe Ala Asn Ser Ser Arg Asn Leu Glu Asn 
300 305 310 


3W 


CTA ATA TCT TGT CCT AAA ATG GAA ACT CCG ACG TCG AGT GTT TAT GTC 
Leu He Ser Cys Pro Lys Met Glu Thr Pro Thr Ser Ser Val Tyr Val 
315 320 325 


1012 


GTC AAC GTT TAT TAT AAG GAC CCT AAT GTT CTT CCA ATC CGT GGT TTT 
Val Asn Val Tyr Tyr Lys Asp Pro Asn Val Leu Pro He Arg Gly Phe 


1060 
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330 335 340 

GGG CTT TTG ATT CCA TCA TGC ACT CCA AAT AAT CCG AAT CAT GTT CTT 1108 
Gly Leu Leu lie Fro Ser Cys Thr Pro Asn Asn Pro Asn His Val Leu 
345 350 355 360 

GGT ATC GTT TTT GAT AGT GAG CAA AAC AAC CCT GAA AAT GGA AGC AAG 1156 
Gly lie Val Phe Asp Ser Glu Gin Asn Asn Pro Glu Asn Gly Ser Lys 
365 370 375 

GTC ACT GTC ATG ATG GGA GGG TCT GCT TAT ACA AAA AAT ACT TCT TTG 1204 
Val Thr Val Met Met Gly Gly Ser Ala Tyr Thr Lys Asn Thr Ser Leu 
380 385 390 

ATT CCA ACC AAC CCC GAA GAA GCC GTT AAC AAT GCT CTC AAA GCT TTG 1252 
lie Pro Thr Asn Pro Glu Glu Ala Val Asn Asn Ala Leu Lys Ala Leu 
395 400 405 

CAG CAT ACT TTA AAA ATA TCC AGT AAG CCA ACA CTC ACG AAT GCA ACA 1300 
Gin His Thr Leu Lys He Ser Ser Lys Pro Thr Leu Thr Asn Ala Thr 
410 415 420 

TTA CAA CCA AAT TGC ATC CCT CAA TAT CGT GTT GGG CAT CAA GAT AAT 1348 
Leu Gin Pro Asn Cys He Pro Gin Tyr Arg Val Gly His Gin Asp Asn 
425 430 435 440 

CTT AAT TCT TTG AAA TCT TGG ATT GAG AAA AAT ATG GGA GGG CGA ATT 1396 
Leu Asn Ser Leu Lys Ser Trp He Glu Lys Asn Met Gly Gly Arg He 
445 450 455 

CTT CTA ACT GGA AGT TGG TAT AAT GGT GTT AGT ATT GGG GAT TGT ATT 1444 
Leu Leu Thr Gly Ser Trp Tyr Asn Gly Val Ser He Gly Asp Cys He 
460 465 470 

ATG AAT GGA CAT TCA ACA GCT CGA AAA CTA GCA TCA TTG ATG AAT TCT 1492 
Met Asn Gly His Ser Thr Ala Arg Lys Leu Ala Ser Leu Met Asn Ser 
475 480 485 

TCT TCT TGAGCGTTTA TAAATGTTGA TATAAAATTA GTATATAGTT CCTTTGATTA 1548 
Ser Ser 
490 

HUAIUMLI 1UMXAA1ULL ALIIUIUMM IJUUlllCHUl i SltflCOCOWF TMWMPiOMlftO iiJC 

GTATATGCGA GGACATTCGA CAAACGTTTG AAATTAAAAA TCATATGCCT TTTAGCTTAA 1668 

GACATCAAGG TCATGAATAA TAAAATTTT 1697 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 4 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser He Ala He Cys Gly Gly Gly He Ala Gly Leu Ser Thr Ala 
15 10 15 

Phe Tyr Leu Ala Arg Leu He Pro Lys Cys Thr lie Asp Leu Tyr Glu 
20 25 30 

Lys Gly Pro Arg Leu Gly Gly Trp Leu Gin Ser Val Lys He Pro Cys 
35 40 45 

Ala As^ Ser Pre Thr Gl" Thr Val Leu Phe Glu Gin Gly Pro Arg Thr 
50 55 60 

Leu Arg Pro Ala Gly Val Ala Gly Leu Ala Asn Leu Asp Leu He Ser 
65 70 75 80 

Lys Leu Gly He Glu Asp Lys Leu Leu Arg He Ser Ser Asn Ser Pro 
85 90 95 

Ser Ala Lys Asn Arg Tyr He Tyr Tyr Pro Asp Arg Leu Asn Glu He 
100 105 110 

Pro Ser Ser He Leu Gly Ser He Lys Ser He Met Gin Pro Ala Leu 
115 120 125 

Arg Pro Met Pro Leu Ala Met Met Leu Glu Pro Phe Arg Lys Ser Lys 
130 135 140 

Arg Asp Ser Thr Asp Glu Ser Val Gly Ser Phe Met Arg Arg Arg Phe 
145 150 155 160 

Gly Lys Asn Val Thr Asp Arg Val Met Ser Ala Met He Asn Gly He 
165 170 175 

Tyr Ala Gly Asp Leu Asn Asp Leu Ser Met His Ser Ser Met Phe Gly 
181) ±8J isu 



Phe Leu Ala Lys He Glu Lys Lys Tyr Gly Asn He Thr Leu Gly Leu 
195 200 205 

lie Arg Ala Leu Leu Ala Arg Glu lie Leu Ser Pro Ala Glu Lys Ala 

210 215 220 

Leu Glu Ser Ser Thr Thr Arg Arg Ala Lys Asn Ser Arg Ala Val Lys 
225 230 235 240 
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Gin Tyr Glu He Asp Lys Tyr Val Ala Phe Lys Glu Gly He Glu Thr 
245 250 255 

He Thr Leu Ser He Ala Asp Glu Leu Lys Lys Met Pro Asn Val Lys 
260 265 270 

He His Leu Asn Lys Pro Ala Gin Thr Leu Val Pro His Lys Thr Gin 
275 280 285 

Ser Leu Val Asp Val Asn Gly Gin Ala Tyr Glu Tyr Val Val Phe Ala 
290 295 300 

Asn Ser Ser Arg Asn Leu Glu Asn Leu lie Ser Cys Pro Lys Met Glu 
305 310 315 320 

Thr Pro Thr Ser Ser Val Tyr Val Val Asn Val Tyr Tyr Lys Asp Pro 

TOR ^nn 



Asn Val Leu Pro He Arg Gly Phe Gly Leu Leu He Pro Ser Cys Thr 
340 345 350 

Pro Asn Asn Pro Asn His Val Leu Gly He Val Phe Asp Ser Glu Gin 
355 360 365 

Asn Asn Pro Glu Asn Gly Ser Lys Val Thr Val Met Met Gly Gly Ser 
370 375 380 

Ala Tyr Thr Lys Asn Thr Ser Leu He Pro Thr Asn Pro Glu Glu Ala 
385 390 395 400 

Val Asn Asn Ala Leu Lys Ala Leu Gin His Thr Leu Lys He Ser Ser 
405 410 415 

Lys Pro Thr Leu Thr Asn Ala Thr Leu Gin Pro Asn Cys lie Pro Gin 
420 425 430 

Tyr Arg Val Gly His Gin Asp Asn Leu Asn Ser Leu Lys Ser Trp He 
435 440 445 

Glu Lys Asn Met Gly Gly Arg lie Leu Leu Thr Gly Ser Trp Tyr Asn 
450 455 460 

Gly Val Ser He Gly Asp Cys lie Met Asn Gly His Ser Thr Ala Arg 
465 470 475 480 

Lys Leu Ala Ser Leu Met Asn Ser Ser Ser 
485 490 



(2) INFORMATION FOR SEQ ID NO: 11: 
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(i) 


SEQUENCE CHARACTERISTICS : 






(A) LENGTH: 41 base pairs 






(B) TYPE: nucleic acid 






(C) STRANDEDNESS : single 






(D) TOPOLOGY : linear 




<ii> 


MOLECULE TYPE: other nucleic acid 






(A) DESCRIPTION: oligonucleotide used to construct 






pCGN1761ENX 




(iii) 


HYPOTHETICAL: NO 




(iv) 


ANTI-SENSE: NO 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 11: 




AATTATGACG TAACGTAGGA ATTAGCGGCC CGCTCTCGAG T 


41 


(2) INFORMATION FOR SEQ ID NO: 12: 




(i) 


SEQUENCE CHARACTERISTICS: 






(A) LENGTH: 40 base pairs 






(B) TYPE: nucleic acid 






(C) STRANDEDNESS: single 






(D) TOPOLOGY: linear 




(ii) 


MOLECULE TYPE: other nucleic acid 






(A) DESCRIPTION: oligonucleotide used to construct 






pCGN1761ENX 




(iii) 


HYPOTHETICAL: NO 




(iv) 


ANTI-SENSE: NO 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 12: 




AATTACTCGA GAGCGGCCGC GAATTCCTAC GTTACGTCAT 


40 


(2) INFORMATION FOR SEQ ID NO: 13: 




(i) 


SEQUENCE CHARACTERISTICS: 






(A) LENGTH: 31 base pairs 






(B) TYPE: nucleic acid 






(C) STRANDEDNESS: single 






(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: primer SON0003 used to 
pSOGlO 


construct 




(iii) HYPOTHETICAL: NO 






(iv) ANTI-SENSE: NO 




• 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 






CTCGGATCCAGCAGATTCGAAGAAGGTACAG 




31 1 


(2) INFORMATION FOR SEQ ID NO: 14: 






<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 






(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: primer SON0004 used to 
pSOGlO 


construct 




(iii) HYPOTHETICAL: NO 






(iv) ANTI-SENSE: NO 






(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 






ACGGGATCCAACTTCCTAGCTGAAAAATGGG 




31 


(2) INFORMATION FOR SEQ ID NO: 15: 






(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 






(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: primer SON0031 used to 
pSOG19 


construct 
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(iii) HYPOTHETICAL: NO 




(iv) ANTI-SENSE: NO 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 




CATGAGGGACTGACCACCCGGGGATC 


26 


(2) INFORMATION FOR SEQ ID NO: 16: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

( r \ STRANDEDNE S S : s ingle 
<D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: priner SON0010 used to construct 
pSOG19 




(iii) HYPOTHETICAL: NO 




(iv) ANTI-SENSE: NO 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 




AGCGGATAACAATTTCACACAGGA 


24 


(2) INFORMATION FOR SEQ ID NO: 17: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 




(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: primer SON0016 used to construct 
pSOG19 




(iii) HYPOTHETICAL: NO 




(iv) ANTI-SENSE: NO 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 






GCTACCATGGCCACATAGAACACC 




24 


(2) INFORMATION FOR SEQ ID NO: 18: 






(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 






(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : primer SONUOI / used to 
pSOG19 


construct 




(iii) HYPOTHETICAL: NO 






(iv) ANTI -SENSE: NO 






(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 






CGAGAGCTCGCACTTCAACCTTG 




23 


(2) INFORMATION FOR SEQ ID NO: 19: 






<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 






(A) DESCRIPTION: printer SON0039 used to 

pSOG30 


construct 




(ill) HYPOTHETICAL: NO 






(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGACATGGTACGTCCTGTAGAAACCCACA 



(2 J INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: primer SON0041 used to construct 
pSOG30 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
ATCGCAAGACCGGCAACAGGATTC 



24 
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We claim: 

1 . An isolated DNA molecule encoding a protein from a eukaryote having 
protoporphyrinogen oxidase(protox) activity. 

2. The isolated DNA molecule of claim 1 wherein said eukaryote is a higher eukaryote. 

3. The isolated DNA molecule of claim 2 wherein said higher eukaryote is a plant. 

4. The isolated DNA molecule of claim 3, wherein said plant is a dicotyledon. 

5. The isolated DNA molecule of claim 4, wherein said dicotyledon is an Arabidopsis 
species. 

6. The isolated DNA molecule of claim 5, wherein said protein comprises the amino 
acid sequence selected from the group consisting of SEQ ID Nos. 2 and 4. 

7. The isolated DNA molecule of claim 3, wherein said plant is a monocotyledon. 

8. The isolated DNA molecule of claim 7, wherein said monocotyledon is maize. 

9. The isolated DNA molecule of claim 8, wherein said protein comprises the amino 
acid sequence selected from the group consisting of SEQ ID Nos. 6 and 8. 

10. A DNA molecule encoding a modified protoporphyrinogen oxidase (protox) 

comprising f * ftiikarvntic prptn* having ft loact nnn nminn oriH mnHifintinn irhnrnin 

said modified protox is tolerant to a herbicide in amounts which inhibit said eukaryotic 
protox. 

11. The DNA molecule of claim 10 wherein said eukaryotic protox is from a plant. 
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12. The DNA molecule of claim 1 1 wherein said plant is a dicotyledon. 

13. The DNA molecule of claim 12, wherein said dicotyledon is an Arabidopsis species. 

14. The DNA molecule of claim 13, wherein said eukaryotic protox comprises the 
amino acid sequence selected from the group consisting of SEQ ID Nos. 2 and 4. 

15. The DNA molecule of claim 14, wherein said eukaryotic protox comprises the 
amino acid sequence set forth in SEQ ID No. 2. 

16. The DNA molecule of claim 15, wherein said at least one amino acid modification is 
a substitution of an amino acid selected from the group consisting of the alanine at 
position 220, the glycine at position 221 and the tyrosine at position 426 of SEQ ID No. 

2. 

17. The DNA molecule of claim 16, wherein said alanine at position 220 is replaced 
with an amino acid selected from the group consisting of valine, threonine, leucine, 
cysteine and tyrosine. 

18. The DNA molecule of claim 16, wherein said glycine at position 221 is replaced 
with a serine. 

19. The DNA molecule of claim 16 wherein said tyrosine at position 426 is replaced 
with an amino acid selected from the group consisting of cysteine, isoleucine, leucine 
and threonine. 



20. The DNA molecule of claim 1 1 , wherein said plant is a monocotyledon. 

21. The DNA molecule of claim 20, wherein said monocotyledon is maize. 
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22. The DNA molecule of claim 21 , wherein said eukaryotic protox comprises the amino 
acid sequence selected from the group consisting of SEQ ID Nos. 6 and 8. 

23. The DNA molecule of claim 22, wherein said eukaryotic protox comprises the amino 
acid sequence set forth in SEQ ID No. 6. 

24. The DNA molecule of claim 23, wherein said at least one amino acid modification is 
a substitution of an amino acid selected from the group consisting of the alanine at 
position 166, the glycine at position 167 and the tyrosine at position 372 of SEQ ID No. 

6. 

25. The DNA molecule of claim 24, wherein said alanine at position 166 is replaced 
with an amino acid selected from the group consisting of valine, threonine, leucine, 
cysteine and tyrosine. 

26. The DNA molecule of claim 24, wherein said glycine at position 167 is replaced 
with a serine. 

27. The DNA molecule of claim 24 wherein said tyrosine at position 372 is replaced 
with an amino acid selected from the group consisting of cysteine, isoleucine, leucine 
and threonine. 

28. The DNA molecule according to any one of claims 10 to 27 which is part of a plant 
genome. 

29. A chimeric gene comprising a promoter operably linked to a heterologous DNA 
molecule encoding a protein from a higher eukaryote having protoporphyrinogen 
oxidase(protox) activity. 

30. The chimeric gene of claim 29 wherein said promoter is active in a plant. 
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31 . The chimeric gene of claim 30 additionally comprising a signal sequence operably 
linked to said DNA molecule, wherein said signal sequence is capable of targeting the 
protein encoded by said DNA molecule into the chloroplast. 

32. The chimeric gene of claim 30 additionally comprising a signal sequence operably 
linked to said DNA molecule, wherein said signal sequence is capable of targeting the 
protein encoded by said DNA molecule into the mitochondria. 

33. A chimeric gene comprising a promoter which is active in a plant operably linked to 
the DNA molecule of claim 10. 

34. The chimeric gene of claim 33 additionally comprising a signal sequence operably 
linked to said DNA molecule, wherein said signal sequence is capable of targeting the 
protein encoded by said DNA molecule into the chloroplast. 

35. The chimeric gene of claim 34 additionally comprising a signal sequence operably 
linked to said DNA molecule, wherein said signal sequence is capable of targeting the 
protein encoded by said DNA molecule into the mitochondria. 

36. The chimeric gene according to any one of claims 29 to 35 which is part of a plant 
genome. 

37. A recombinant vector comprising the chimeric gene of any one of claims 29 to 35, 
wherein said vector is capable of being stably transformed into a host cell. 

38. A recombinant vector comprising the chimeric gene of claim 33. wher ein said 
vector is capable of being stably transformed into a plant cell. 

39. A host cell stably transformed with a vector according to any one of claims 37 or 
38, wherein said host cell is capable of expressing said DNA molecule. 
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40. A host cell of claim 39 selected from the group consisting of a plant cell, a 
bacterial cell, a yeast cell, and an insect cell. 

41 . A plant or plant cell including the progeny thereof comprising a DNA molecule of 
any one of claims 10 to 28 , wherein said DNA molecule is expressed in said plant and 
confers upon said plant and plant cell, respectivly, tolerance to a herbicide in amounts 
which inhibit naturally occurring protox activity 

42. A plant or plant cell including the progeny thereof comprising a chimeric gene of 
any one of claims 29 to 36, wherein said chimeric gene confers upon said plant and 
plant cell, respectively, tolerance to a herbicide in amounts which inhibit naturally 
occurring protox activity. 

43. A plant and its progeny including parts thereof having altered protoporphyrinogen 
oxidase(protox) activity, wherein said altered protox activity confers upon said plant and 
its progeny tolerance to a herbicide in amounts which inhibit naturally occurring protox 
activity. 

44. The plant of any one of claims 41 to 43, wherein said plant is a dicotyledon. 

45. The plant of claim 44, wherein said plant is selected from the group consisting of 
soybean, cotton, tobacco, sugar beet, and oilseed rape. 

46. The plant of any one of claims 41 to 43, wherein said plant is a monocot. 

47. The plant of claim 46. wherein said plant k gplprtpri fmm thp nr^.r, ^ncictinn *t 
maize, wheat, sorghum, rye, oats, turf grass and rice. 

48. The plant of any one of claims 41 to 43, wherein said altered protox activity is 
conferred by over-expression of a protox enzyme which naturally occurs in said plant. 
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49. The plant of any one of claims 41 to 43, wherein said altered protox activity is 
conferred by expression of a DNA molecule encoding a herbicide tolerant protox 
enzyme. 

50. The plant of claim 49, wherein said herbicide tolerant protox enzyme naturally 
occurs in a prokaryote. 

51 . The plant of claim 50 wherein said prokaryote is selected from the group consisting 
of 

E. coll, B. subtilis and S. typhimurium. 

52. The plant of claim 49 wherein said herbicide tolerant protox enzyme is a modified 
form of a protein which naturally occurs in a prokaryote. 

53. The seed of a plant according to any one of claims 41 to 52. 

54. A plant according to any one of claims 41 to 52, which is a hybrid plant. 

55. Propagating material of a plant according to any one of claims 41 to 54 treated with a 
protectant coating. 

56. Propagating material according to claim 55, comprising a preparation selected from the 
group consisting of herbicides, insecticides, fungicides, bactericides, nematicides, 
molluscicides or mixtures thereof. 

57. Propagating material accorriipn to cl aim nr rhoro^ n r;^w ;» — r j f f n n f 
seed. 

58. A method for controlling the growth of undesired vegetation which comprises 
applying to a population of the plant of any one of claims 41 to 54 an effective amount 
of a protox-inhibiting herbicide. 
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59. The method of claim 58 wherein said plant is selected from the group consisting of 
soybean, cotton, tobacco, sugar beet, oilseed rape, maize, wheat, sorghum, rye, oats, 
turf grasses and rice. 

60. The method of claim 59 wherein said protox-inhibiting herbicide is selected from 
the group consisting of an aryluracil, a diphenylether, an oxidiazole, an imide, a phenyl 
pyrazole, a pyridine derivative, a 3-substituted-2-aiyl-4,5,6,7-tetrahydroindazole, a 
phenopylate and Ophenylpyrrolidino- and piperidinocarbamate analogs of said 
phenopylate. 

61. The method of claim 60 wherein said protox-inhibiting herbicide is an imide having 
the formula 




(Formula V) 



wherein Q equals 




OR 




(Formula VI) 



(Formula VII) 



(Formula VIII) 



(Formula IX) 




CI 




\ 



\ 



OR 



CH 3 



OR 



CH 3 
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(Formula IXa) (Formula IXb) 

and wherein R, equals H, CI or F, R? equals CI and R 3 is an optimally substituted ether, 
thioether, ester, amino or alkyl group, and wherein R 2 and R 3 together may form a 5 or 
6 membered heterocyclic ring. 

62. The method of claim 61 wherein said imide is selected from the group consisting of 



ci- 



,ci o 

/=< J^ N ' CHF2 



CHfeS0 2 NH 



(Formula X); 




(Formula XI); 



H 5 C 2 OOCCH 2 0 




M ^OCHF 2 
I 

CH 3 



(Formula XII); 



e 



o 




scHjCOOC^ (Formula XIII); 
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O OCHjCOOCsH, , (Formula XIV); 




c ^ (Formula XV); 

Hcsc-Cht o (Formula XVI); and 




COOR (Formula XVII) 

wherein R signifies (C 2 . 6 -alkenyloxy)carbonyl-Ci-4-alkyl. 



63. The method of claim 58 wherein said protox-inhibiting herbicide has the formula 
selected from the group consisitng of 
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(Formula XVIII), 




(Formula XIX), 



CF 3 



(Formula XXh 

64. A method for assaying a chemical for the ability to inhibit the activity of a protox 
enzyme from a plant comprising 

(a) combining said protox enzyme and protoporphyrinogen IX in a first reaction 
mixture under conditions in which said protox enzyme is capable of catalyzing the 
conversion of said protoporphyrinogen IX to protoporphyrin IX; 
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(b) combining said chemical, said protox enzyme and protoporphyrinogen IX in a 
second reaction mixture under the same conditions as in said first reaction mixture; 

(c) exciting said first and said second reaction mixtures at about 395 to about 410 
nM; 

(d) comparing the flourescence of said first and said second reaction mixtures at 
about 622 to about 635 nM; 

wherein said chemical is capable of inhibiting the activity of said protox enzyme if the 
flourescence of said second reaction mixture is significantly less than the flourescence 
of said first reaction mixture. 

65. A method of identifying a modified protox enzyme resistant to a protox inhibitor 
present in a population of cells comprising the steps of 

(a) culturing said population in the presence of said protox inhibitor in amounts which 
inhibit the unmodified form of said protox enzyme; 

(b) selecting those cells from step (a) whose growth is not inhibited; and 

(c) isolating and identifying the protox enzyme present in the cells selected from step 

(b). 

66. A method of selecting plants, plant tissue or plant cells transformed with a 
transgene of interest from non-transformed plants, comprising the steps of: 

(a) transforming a plant, plant tissue or plant cell with a transgene of interest capable 
of being expressed by the plant, and a gene encoding an altered protox resistant to a 
protox inhibitor; 

(b) transferring the thus-transformed plants or plant cells to a medium comprising the 
protox inhibitor; and 

(C) selecting ine plants or plant cells which survive in the medium. 

67. A probe capable of specifically hybridizing to a eukaryotic protoporphyrinogen 
oxidase gene or mRNA, wherein said probe comprises a contiguous portion of the 



WO 95/34659 



PCT/IB95/00452 



- Ill - 

coding sequence for a protoporphyrinogen oxidase from a eukaryote at least 10 
nucleotides in length. 

68. The probe of claim 67 wherein said coding sequence is selected from the group 
consisting of SEQ ID Nos. 1,3,5,7 and 9. 

69. A method of producing a host cell comprising an isolated DNA molecule encoding 
a protein from a eukaryote having protoporphyrinogen oxidase(protox) activity 
comprising transforming the said host cell with a recombinant vector molecule 
according to claim 37 or 38. 

70. A method of producing a plant cell comprising an isolated DNA molecule encoding 
a protein from a eukaryote having protoporphyrinogen oxidase(protox) activity 
comprising transforming the said plant cell with a recombinant vector molecule 
according to claim 37 or 38. 

71 . A method of producing transgenic progeny of a transgenic parent plant comprising 
an isolated DNA molecule encoding a protein from a eukaryote having 
protoporphyrinogen oxidase(protox) activity comprising transforming the said parent 
plant with a recombinant vector molecule according to claim 37 or 38 and transferring 
the herbicide tolerant trait to the progeny of the said transgenic parent plant involving 
known plant breeding techniques. 

72. A method of producing a DNA molecule encoding a protein from a eukaryote 
having protoporphyrinogen oxidase(protox) activity comprising 

(alestablifihipn * r.DNA library/ fmm a e ,.itaKI Q a ..^ T Ati ^. 

(b) identifying cDNA clones encoding a protox enzyme based on their ability to supply 
protox enzymatic activity to a mutant host organism deficient in this activity. 

73. A method of producing a DNA molecule encoding a protein from a eukaryote 
having protoporphyrinogen oxidase(protox) activity comprising 
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(a) establishing a genomic or a cDNA library from a suitable eukaryotic source; 

(b) probing the said library with a probe molecule according to claim 67. 

74. Use of a DNA molecule according to any one of claims 28 or 36 to confer tolerance 
to a herbicide in amounts which inhibit naturally occurring protox activity from a parent 
plant to its progeny comprising first stably transforming the parent plant with a DNA 
molecule according to any one of claims 10 to 27 by stably incorporating the said DNA 
molecule into the plant genome of the said parent plant and second transferring the 
herbicide tolerant trait to the progeny of the said transgenic parent plant involving 
known plant breeding techniques. 

75. Use of a DNA molecule according to any one of claims 28 or 36 to prepare a 
medical tool for the treatment of deficiencies in protoporphyrinogen oxidase(protox) 
activity in animals, particularly humans. 

76. Use of a DNA molecule according to any one of claims 28 or 36 to prepare a 
medical tool for diagnosing deficiencies in protoporphyrinogen oxidase(protox) activity 
in animals, particularly humans. 

77. A pharmaceutical composition comprising together with a pharmaceutical^ 
acceptable carrier a protein obtainable from a eukaryote having protoporphyrinogen 
oxidase(protox) activity to be used in a method for treatment the animal or human body 
or for diagnostic purposes. 
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